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(57) Abstract 

A set of contiguous and partially overlapping RNA sequences and polypeptides encoded thereby, designated as LUI03 and transcribed 
from lung tissue is described. A fully sequenced clone representing the longest continuous sequence of LU103 is also disclosed. These 
sequences are useful for detecting, diagnosing, staging, monitoring, prognosticating, preventing or treating, or determining the predisposition 
of an individual to diseases and conditions of the lung such as lung cancer. 
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Background of the Invention 

The invention relates generally to detecting diseases of the lung. More 
particularly, the invention relates to reagents such as polynucleotide sequences and 
the polypeptide sequences encoded thereby, as well as methods which utilize these 
sequences. The polynucleotide and polypeptide sequences are useful for detecting, 
diagnosing, staging, monitoring, prognosticating, preventing or treating, or 
determining predisposition to diseases or conditions of the lung such as lung cancer. 

Lung cancer is the second most common cancer for both men and women in 
the United States, with an estimated 178,100 newly diagnosed during 1997 
(American Cancer Society statistics). It also is the most common cause of cancer 
death for both sexes, with over 160,000 lung cancer related deaths expected in 
1997. Lung cancer is a major health problem in other areas of the world, with 
approximately 135,000 new cases occurring each year in the European Union, and 
its incidence rapidly increasing in Central and Eastern Europe. See, Genesis Report . 
February 1995 andT. Reynolds. J. Natl. Cancer Inst . 87: 1348-1349 (1995). 

Early stage lung cancer can be detected by chest radiograph and the sputum 
cytological examination; however, these procedures do not have sufficient sensitivity 
for routine use as screening tests for asymptomatic individuals. Potential technical 
problems which can limit the sensitivity of chest radiograph include suboptimal 
technique, insufficient exposure, and positioning and cooperation of the patient. 
T.G. Tape et al., Ann. Intern. Med. 104: 663-670 (1986). Moreover, radiologists 
often disagree on interpretations of chest radiographs; over 40% of these 
disagreements are significant or potentially significant, with false-negative 
interpretations being the cause of most errors. P.G. Herman ct al., Chest 68: 278- 
282 (1975). Inconclusive results require additional follow-up testing for 
clarification. T.G. Tape et ah, supra . Sputum cytology is even less sensitive than 
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chest radiography in detecting early lung cancer; of 160 lung cancer cases, 
radiography alone detected 123 cases (77%) while cytological examination alone 
detected 67 cases (42%). The National Cancer Institute "Early Lung Cancer 
Detection: Summary and Conclusion/' Am, Rev. Resp. Pis. 130: 565-567 (1984). 
5 Factors affecting the ability of sputum cytological examination to diagnose lung 
cancer include the ability of the patient to produce sufficient sputum, the size of the 
tumor, the proximity of the tumor to major airways, the histologic type of the tumor, 
and the experience and training of the cytopathologist. R.J. Ginsberg et al. In: 
Cancer: Principles and Practice of Oncology . Fourth Edition . V.T. DeVita, S. 
10 Hellman, S.A. Rosenburg, pp. 673-723, Philadelphia, PA: J.B. Lippincott Co. 
(1993). 

A majority of new lung cancers are being detected only when the disease has 
spread beyond the lung. In the United States only 16% of new non-small cell lung 
cancers are detected at a localized stage when 5-year survival is highest (at 49.7%). 

15 In contrast, 68% of new cases are detected when the disease has already spread 

locally (regional disease) or metastasized to distant sites (distant disease) which have 
significantly lower 5-ycar survival rates of only 18.5% and 1.8%, respectively. 
Similarly, 80% of newly detected small-cell lung cancers are discovered with 
regional disease or distant disease, which have 5-year survival rates of only 9.5% 

20 and 1.7%, respectively. Stat Bite. J. Natl. Cancer Inst. 87: 1662, 1995. Thus 

current procedures fail to detect lung cancer at an early, treatable stage of the disease. 
Improved methods of detection therefore are needed to reduce mortality. 

After diagnosis, the patient's cancer is staged. Staging is a strong predictor 
of patient outcome and determines the treatment regimen for the patient. Patients 

25 with cell lung cancer can undergo routine CT scanning of the chest and upper 

abdomen in an effort to detect lymph node metastasis, pulmonary metastases, and 
liver and adrenal metastases. The results of this CT scan frequently are inconclusive 
and lead to additional testing, including bone scans. Staging of patients may also 
include bone scans, fiberoptic bronchoscopy with bronchial washings, in addition to 

30 biopsy and liver function tests. 

The most frequently used methods for monitoring lung cancer patients after 
primary therapy are clinic visits, chest X-rays, complete blood counts, liver function 
tests and chest CT scans. Detecting recurrence by such monitoring techniques, 
however, does not greatly affect mode of treatment and overall survival time. This 

35 leads to the conclusion that current monitoring methods are not cost effective. K.S. 
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Naunheim et al.. Ann. Thnrac. Sure , 60: 1612-1616(1995). G. L. Walsh et aL, 
Ann.Thorac. Surg . 60: 1563-1572 (1995). 

Attempts have been made to discover improved tumor markers for lung 
cancer by first identifying differentially expressed cellular components in lung tumor 
5 tissue compared to normal lung tissue. For example, two-dimensional 

polyacrylamide gel electrophoresis has been used to characterize quantitative and 
qualitative differences in polypeptide composition. T. Hirano et al., Br. J. Cancer 
72: 840-848 (1995); AT. Endler et al., J. Clin. Chem Clin. Biochem. 24:981-992 
(1986). The sensitivity of this technique is limited, however, by the degree of 

1 0 protein resolution of the two electrophoretic steps and by the detection step. This 
step depends on staining protein in gels. The polypeptide instability may generate 
artifacts in the two-dimensional pattern. Another technique, subtractive 
hybridization, has been used to screen for differences in gene expression between 
normal and tumor tissue. P.S. Steeg et al.. J. Natl. Cancer Inst. 80: 200-204 

15 (1988). This technique is laborious and has limitations in detecting mRNA species 
in tissues present in low amounts. A more sensitive method for identifying 
differentially expressed genes is differential display. P. Liang et al., Cancer Res. 
52:6966-6968 (1992). This method involves the reverse transcription of cellular 
mRNAs to cDNAs followed by PCR amplification of a cDNA subpopulation. 

20 Comparison of amplified cDN A subpopulations between normal and tumor lung 

tissues allows identification of mRNA species that are differentially expressed. This 
technique has greater sensitivity than subtractive hybridization for detecting mRNAs 
of low abundance, but is a difficult technique to perform in a routine clinical 
laboratory and therefore is confined to the research setting. A novel gene termed N8 

25 recently was found by differential display to express higher levels of mRNA in lung 
tumor than in normal lung tissue. S.L. Chen et al., Oncogene 12: 741-751 (1996). 
However, no marker currently is available for use in routine screening assay 
techniques, such as immunological assays. Tests based upon the appearance of 
various markers in test samples such as blood, plasma or serum and detectable by 

30 such immunological methods could provide low-cost, non-surgical, diagnostic 

information to aid the physician to make a diagnosis of cancer, help stage a patient, 
select a therapy protocol or monitor the success of the chosen therapy. 

Such markers have been placed into several categories. The first category 
contains those markers which arc elevated in disease. Examples include chorionic 

35 gonadotropin (HCG) which is elevated in testicular cancer and alpha fetoprotein 
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( AFP) which is elevated in hepatocellular carcinoma (HCC). E.L. Jacobs, Curr, 
Prohl. Cancer 15 (6): 299-350 (1991). The second category contains those markers 
which are altered in disease. Examples include splice variants of CD44 in bladder 
cancer Y. Matsumura et aL Journal Pathology 175 (Suppl): 108 A (1995) and 

5 mutations in p53 in lung and colorectal cancer. W.P. Bennett, Cancer Detection and 
Prevention 19 (6): 503-5 1 1 (1995). In the latter case, p53 mutations result in a 
protein which is defective in function and which may or may not be detectable by 
assays based on function or specific antibodies directed against the native protein. 
The third category contains those markers which are normal proteins but which 

10 appear in an inappropriate body compartment. Examples include prostate specific 
antigen (PSA) which is a normal protein secreted at high levels into the seminal 
fluid, but which is present in very low levels in the blood of men with normal 
prostates. P.H. Lange et ah. Urology 33 (6 Suppl): 13 (1989). However, in 
patients with diseases of the prostate, including benign prostatic hyperplasia (BPH) 

1 5 or adenocarcinoma of the prostate, the level of PSA is markedly elevated in the 
blood and is a strong indication of disease of the prostate. Similarly, 
carcinoembryonic antigen (CEA) is a normal component of the inner lining of the 
colon and is present in blood only at low levels in people without diseases of the 
colon. E.L. Jacobs, supra . However, in diseases of the colon including 

20 inflammatory bowel disease and adenocarcinoma of the colon, the concentration of 
CEA is markedly elevated in the blood plasma or serum of many patients and is an 
indicator of disease in the tissue. It also has been recognized that while CEA and 
PSA are produced in some tissues other than the colon or prostate, respectively, 
these markers still are useful in the diagnosis of disease of their primary tissue of 

25 origin due to their strong tissue selectivity. 

There are yet other examples of inappropriate compartmentalization of 
markers. For example, in the case of metastatic cancer, lymph nodes often contain 
cells which have originated from the primary tumor and which often express 
immunohistochemical markers of the primary tumor. CEA and PSA both have been 

30 detected in the lymph nodes of patients with metastisized cancer. Other 

compartments in which the inappropriate appearance of normal gene products are 
indicative of disease include the formed elements of whole blood, which are thought 
to provide evidence of the metastatic spread of the disease. To date, however, no 
such marker for the screening or diagnosis of lung diseases such as lung cancer, 

35 asthma and adult respiratory distress syndrome exists. 
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It therefore would be advantageous to provide methods and reagents for 
detecting, diagnosing, staging, monitoring, prognosticating, preventing or 
treating, or determining the predisposition to diseases and conditions of die lung 
such as lung cancer. Such methods would include assaying a test sample for 
5 products of a gene (or genes) which are overexpressed in diseases and 
conditions associated with lung cancer. Such methods may also include 
assaying a test sample for products of a gene (or genes) which have been altered 
by the diseases and conditions associated with lung cancer. Such methods may 
further include assaying a testsample for products of a gene (or genes) whose 

10 distribution among the various tissues and compartments of the body have been 
altered by the diseases and conditions associated with lung cancer. Such 
methods would comprise making cDNA from mRNA in the test sample, 
amplifying (when necessary) portions of the cDNA corresponding to the gene or 
a fragment thereof, and detecting the cDNA product as an indication of the 

15 presence of the cancer: or detecting translation products of the mRNAs 

comprising the gene sequence(s)as an indication of the presence of the disease. 
These reagents include polynucleotide(s) or fragment(s) thereof which may be 
used in diagnostic methods such as reverse transcriptase-polymerase chain 
reaction (RT-PCR), polymerase chain reaction (PCR), or hybridization assays of 

20 biopsied tissue; polypeptides which are the translation products of such mRNAs; 
or antibodies directed against these proteins. Such methods would include 
assaying a sample for product(s) of the gene and detecting the product(s) as an 
indication of lung cancer. Drug treatment or gene therapy for lung diseases such 
as lung cancer can be based on these identified gene sequences or their expressed 

25 polypeptides, and efficacy of any particular therapy can be monitored using the 
diagnostic methods disclosed herein. Furthermore, it would be advantageous to 
have available alternate diagnostic methods capable of detecting early lung cancer 
in a non-invasive manner. Also of benefit would be methods to stage and 
monitor the treatment of lung disease. 

30 

Summary of the Invention 

The present invention provides a method of detecting a target LU103 
polynucleotide in a test sample which comprises contacting the test sample with at 
least one LUI03-spccific polynucleotide and detecting the presence of the target 
35 LU103 polynucleotide in the test sample. The LU103-specific polynucleotide has at 
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least 50% identity with a polynucleotide selected from the group consisting of 
SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE 
ID NO 4, SEQUENCE ID NO 5, and fragments or complements thereof. Also, the 
LU103-specific polynucleotide may be attached to a solid phase prior to performing 
5 the method. 

The present invention also provides a method for detecting LU 103 mRNA in 
a test sample, which comprises performing reverse transcription (RT) with at least 
one primer in order to produce cDN A, amplifying the cDNA so obtained using 
LU103 oligonucleotides as sense and antisense primers to obtain LU103 amplicon, 

10 and detecting the presence of the LU 103 amplicon as an indication of the presence of 
LU103 mRNA in the test sample, wherein the LUI03 oligonucleotides have at least 
50% identity to a sequence selected from the group consisting of SEQUENCE ID 
NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5. and fragments or complements thereof. Amplification can be 

1 5 performed by the polymerase chain reaction. Also, the test sample can be reacted 
with a solid phase prior to performing the method, prior to amplification or prior to 
detection. This reaction can be a direct or an indirect reaction. Further, the detection 
step can comprise utilizing a detectable label capable of generating a measurable 
signal. The detectable label can be attached to a solid phase. 

20 The present invention further provides a method of detecting a target LU 103 

polynucleotide in a test sample suspected of containing target LU103 
polynucleotides ; which comprises (a) contacting the test sample with at least one 
LU103 oligonucleotide as a sense primer and at least one LU103 oligonucleotide as 
an anti-sense primer, and amplifying same to obtain a first stage reaction product; (b) 

25 contacting the first stage reaction product with at least one other LU 103 

oligonucleotide to obtain a second stage reaction product, with the proviso that the 
other LU103 oligonucleotide is located 3* to the LU103 oligonucleotides utilized in 
step (a) and is complementary to the first stage reaction product; and (c) detecting the 
second stage reaction product as an indication of the presence of a target LU103 

30 polynucleotide in the test sample. The LU103 oligonucleotides selected as reagents 
in the method have at least 50% identity to a sequence selected from the group 
consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, 
SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. Amplification may be performed by the polymerase chain reaction. The test 

35 sample can be reacted either directly or indirectly with a solid phase prior to 
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performing the method, or prior to amplification, or prior to deteccion. The detection 
step also comprises utilizing a detectable label capable of generating a measurable 
signal; further, the detectable label can be attached to a solid phase. Test kits useful 
for detecting target LU103 polynucleotides in a test sample are also provided which 

5 comprise a container containing at least one LU 1 03 specific polynucleotide selected 
from the group consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, 
SEQUENCE ID NO 3, SEQUENCE ID NO 4. SEQUENCE ID NO 5, and 
fragments or complements thereof. These test kits further comprise containers with 
tools useful for collecting test samples (such as, for example, blood, urine, saliva 

1 0 and stool). Such tools include lancets and absorbent paper or cloth for collecting and 
stabilizing blood; swabs for collecting and stabilizing saliva; and cups for collecting 
and stabilizing urine or stool samples. Collection materials, such as papers, cloths, 
swabs, cups and the like, may optionally be treated to avoid denaturation or 
irreversible adsorption of the sample. The collection materials also may be treated 

1 5 with or contain preservatives, stabilizers or antimicrobial agents to help maintain the 

integrity of the specimens. 

The present invention provides a purified polynucleotide or fragment thereof 
derived from an LUI03 gene. The purified polynucleotide is capable of selectively 
hybridizing to the nucleic acid of the LU 103 gene, or a complement thereof. The 

20 polynucleotide has at least 50% identity to a polynucleotide selected from the group 
consisting of SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, 
SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
thereof. Further, the purified polynucleotide can be produced by recombinant and/or 
synthetic techniques. The purified recombinant polynucleotide can be contained 

25 within a recombinant vector. The invention further comprises a host cell transfected 
with said vector. 

The present invention further provides a recombinant expression system 
comprising a nucleic acid sequence that includes an open reading frame derived from 
LU103. The nucleic acid sequence has at least 50% identity with a sequence selected 
30 from the group consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, 
SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
fragments or complements thereof. The nucleic acid sequence is operably linked to a 
control sequence compatible with a desired host. Also provided is a cell transfected 
with this recombinant expression system. 
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The present invention also provides a ceil transfected with an LUI03 nucleic 
acid sequence that encodes at least one epitope of an LU103 antigen, or fragment 
thereof. The nucleic acid sequence is selected from the group consisting of 
SEQUENCE ID NO 1, SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE 
5 ID NO 4, SEQUENCE ID NO 5, and fragments or complements thereof. 

Also provided is a composition of matter that comprises an LU103 
polynucleotide of at least about 10-12 nucleotides having at least 50% identity to a 
polynucleotide selected from the group consisting of SEQUENCE ID NO I, 
SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE 

10 ID NO 5, and fragments or complements thereof. The LU103 polynucleotide 

encodes an amino acid sequence having at least one LU103 epitope. Also provided is 
a gene, or fragment thereof, coding for an LU103 polypeptide which has at least 
50% identity to SEQUENCE ID NO 14; and a gene, or a fragment thereof, 
comprising DNA having at least 50% identity to SEQUENCE ID NO 4 or 

15 SEQUENCE ID NO 5. 

Brief Description of the Drawings 

FIGURE 1 shows the nucleotide alignment of clones 1235095 (SEQUENCE 
ID NO 1), 1235531 (SEQUENCE ID NO 2), 1379417 (SEQUENCE ID NO 3), and 

20 the consensus sequence (SEQUENCE ID NO 4) derived therefrom; 

FIGURE 2 shows the contig map depicting the formation of the consensus 
nucleotide sequence (SEQUENCE ED NO 4) from the nucleotide alignment of 
overlapping clones 1235095 (SEQUENCE ID NO 1), 123553 1 (SEQUENCE ID 
NO 2), and 1379417 (SEQUENCE ID NO 3); 

25 FIGURE 3 A is a scan of an ethidium bromide stained agarose gel of RNA 

from various tissue extracts and the corresponding northern blot of RNA using 
LU103 radiolabeled probe; FIGURE 3B is a scan of an ethidium bromide stained 
agarose gel of RNA from various lung tissues and the corresponding northern blot 
of RNA using LU103 radiolabeled probe; 

30 FIGURE 4 A is a scan of an ethidium bromide stained agarose gel of LU 1 03- 

specific and RNA control-specific primed PCR amplification products from lung 
tissue RNAs; FIGURE 4B is a scan of an ethidium bromide stained agarose gel of 
LU103-specific primed PCR amplification products from RNAs of lung, prostate, 
breast and colon tissues. 
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Detailed Description of the Invention 

The present invention provides a gene or a fragment thereof which codes for 
an LU103 polypeptide having at least about 50% identity to SEQUENCE ID NO 14. 
The present invention further encompasses an LU103 gene or a fragment thereof 
5 comprising DNA which has at least about 50% identity to SEQUENCE ID NO 4 or 
SEQUENCE ID NO 5. 

The present invention provides methods for assaying a test sample for 
products of a lung tissue gene designated as LU103, which comprises making 
cDNA from mRNA in the test sample, and detecting the cDNA as an indication of 

10 the presence of lung tissue gene LU 103. The method may include an amplification 
step, wherein one or more portions of the mRNA from LU103 corresponding to the 
gene or fragments thereof, is amplified. Methods also are provided for assaying for 
the translation products of LU103. Test samples which may be assayed by the 
methods provided herein include tissues, cells, body fluids and secretions. The 

15 present invention also provides reagents such as oligonucleotide primers and 
polypeptides which are useful in performing these methods. 

Portions of the nucleic acid sequences disclosed herein are useful as primers 
for the reverse transcription of RNA or for the amplification of cDN A; or as probes 
to determine the presence of certain mRNA sequences in test samples. Also 

20 disclosed arc nucleic acid sequences which permit the production of encoded 
polypeptide sequences which are useful as standards or reagents in diagnostic 
immunoassays, as targets for pharmaceutical screening assays and/or as components 
or as target sites for various therapies. Monoclonal and polyclonal antibodies 
directed against at least one epitope contained within these polypeptide sequences are 

25 useful as delivery agents for therapeutic agents as well as for diagnostic tests and for 
screening for diseases or conditions associated with LU103, especially lung cancer. 
Isolation of sequences of other portions of the gene of interest can be accomplished 
utilizing probes or PCR primers derived from these nucleic acid sequences. This 
allows additional probes of the mRNA or cDNA of interest to be established, as well 

30 as corresponding encoded polypeptide sequences. These additional molecules are 
useful in detecting, diagnosing, staging, monitoring, prognosticating, preventing or 
treating, or determining the predisposition to, diseases and conditions of the lung 
such as lung cancer, characterized by LU103, as disclosed herein. 

Techniques for determining amino acid sequence "similarity" are well-known 

35 in the art. In general, "similarity" means the exact amino acid to amino acid 
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comparison of two or more polypeptides at the appropriate place, where amino acids 
are identical or possess similar chemical and/or physical properties such as charge or 
hydrophobicity. A so-termed "percent similarity" then can be determined between 
the compared polypeptide sequences. Techniques for determining nucleic acid and 
5 amino acid sequence identity also are well known in the art and include determining 
the nucleotide sequence of the mRNA for that gene (usually via a cDNA 
intermediate) and determining the amino acid sequence encoded thereby, and 
comparing this to a second amino acid sequence. In general, "identity" refers to an 
exact nucleotide to nucleotide or amino acid to amino acid correspondence of two 

10 polynucleotides or polypeptide sequences, respectively. Two or more 

polynucleotide sequences can be compared by determining their "percent identity." 
Two or more amino acid sequences likewise can be compared by determining their 
"percent identity." The programs available in the Wisconsin Sequence Analysis 
Package, Version 8 (available from Genetics Computer Group, Madison, WI), for 

15 example, the GAP program, arc capable of calculating both the identity between two 
polynucleotides and the identity and similarity between two polypeptide sequences, 
respectively. Other programs for calculating identity or similarity between sequences 
are known in the art. 

The compositions and methods described herein will enable the identification 

20 of certain markers as indicative of a lung tissue disease or condition; the information 
obtained therefrom will aid in the detecting, diagnosing, staging, monitoring, 
prognosticating, preventing or treating, or determining diseases or conditions 
associated with LU103, especially lung cancer. Test methods include, for example, 
probe assays which utilize the sequence(s) provided herein and which also may 

25 utilize nucleic acid amplification methods such as the polymerase chain reaction 
(PCR), the ligase chain reaction (LCR), and hybridization. In addition, the 
nucleotide sequences provided herein contain open reading frames from which an 
immunogenic epitope may be found. This epitope is believed to be unique to the 
disease state or condition associated with LU103. It also is thought that the 

30 polynucleotides or polypeptides and protein encoded by the LU103 gene are useful 
as a marker. This marker is either elevated in disease such as lung cancer, altered in 
disease such as lung cancer, or present as a normal protein but appearing in an 
inappropriate body compartment. The uniqueness of the epitope may be determined 
by (i) its immunological reactivity and specificity with antibodies directed against 

35 proteins and polypeptides encoded by the LU103 gene, and (ii) its nonreactivity with 
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any other tissue markers. Methods for determining immunological reactivity are 
well-known and include but are not limited to. for example, radioimmunoassay 
(RIA), enzyme-linked immunosorbent assay (ELISA), hemagglutination (HA), 
fluorescence polarization immunoassay (FPIA), chemiluminescent immunoassay 
5 (CLIA) and others. Several examples of suitable methods arc described herein. 

Unless otherwise stated, the following terms shall have the following 
meanings: 

A polynucleotide "derived from" or "specific for" a designated sequence 
refers to a polynucleotide sequence which comprises a contiguous sequence of 
10 approximately at least about 6 nucleotides, preferably at least about 8 nucleotides, 
more preferably at least about 10-12 nucleotides, and even more preferably at least 
about 15-20 nucleotides corresponding, i.e., identical or complementary to, a region 
of the designated nucleotide sequence. The sequence may be complementary or 
identical to a sequence which is unique to a particular polynucleotide sequence as 

15 determined by techniques known in the art. Comparisons to sequences in 

databanks, for example, can be used as a method to determine the uniqueness of a 
designated sequence. Regions from which sequences may be derived, include bur 
are not limited to, regions encoding specific epitopes, as well as non-translated 
and/or non-transcribed regions. 

20 The derived polynucleotide will not necessarily be derived physically from 

the nucleotide sequence of interest under study, but may be generated in any manner, 
including but not limited to chemical synthesis, replication, reverse transcription or 
transcription, which is based on the information provided by the sequence of bases 
in the region(s) from which the polynucleotide is derived. As such, it may represent 

25 either a sense or an antisense orientation of the original polynucleotide. In addition, 
combinations of regions corresponding to that of the designated sequence may be 
modified in ways known in the art to be consistent with the intended use. 

A "fragment" of a specified polynucleotide refers to a polynucleotide 
sequence which comprises a contiguous sequence of approximately at least about 6 

30 nucleotides, preferably at least about 8 nucleotides, more preferably at least about 
10-12 nucleotides, and even more preferably at least about 15-20 nucleotides 
corresponding, i.e., identical or complementary to, a region of the specified 
nucleotide sequence. 

The term "primer" denotes a specific oligonucleotide sequence which is 

35 complementary to a target nucleotide sequence and used to hybridize to the target 
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nucleotide sequence. A primer serves as an initiation point for nucleotide 
polymerization catalyzed by either DNA polymerase, RNA polymerase or reverse 
transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide 
5 analog segment, e.g., PNA as defined hereinbelovv) which can be used to identify a 
specific polynucleotide present in samples bearing the complementary sequence. 

"Encoded by" refers to a nucleic acid sequence which codes for a polypeptide 
sequence, wherein the polypeptide sequence or a portion thereof contains an amino 
acid sequence of at least 3 to 5 amino acids, more preferably at least 8 to 10 amino 
10 acids, and even more preferably at least 1 5 to 20 amino acids from a polypeptide 

encoded by the nucleic acid sequence. Also encompassed are polypeptide sequences 
which are immunologically identifiable with a polypeptide encoded by the sequence. 
Thus, a "polypeptide," "protein," or "amino acid" sequence has at least about 50% 
identity, preferably about 60% identity, more preferably about 75-85% identity, and 
15 most preferably about 90-95% or more identity to an LU103 amino acid sequence. 
Further, the LU103 "polypeptide," "protein," or "amino acid" sequence may have at 
least about 60% similarity, preferably at least about 75% similarity, more preferably 
about 85% similarity, and most preferably about 95% or more similarity to a 
polypeptide or amino acid sequence of LU103. This amino acid sequence can be 
20 selected from the group consisting of SEQUENCE ID NOS 14-19, and fragments 
thereof. 

A "recombinant polypeptide," "recombinant protein," or "a polypeptide 
produced by recombinant techniques," which terms may be used interchangeably 
herein, describes a polypeptide which by virtue of its origin or manipulation is not 
25 associated with all or a portion of the polypeptide with which it is associated in 

nature and/or is linked to a polypeptide other than that to which it is linked in nature. 
A recombinant or encoded polypeptide or protein is not necessarily translated from a 
designated nucleic acid sequence. It also may be generated in any manner, including 
chemical synthesis or expression of a recombinant expression system. 
30 The term "synthetic peptide" as used herein means a polymeric form of 

amino acids of any length, which may be chemically synthesized by methods well- 
known to the routineer. These synthetic peptides arc useful in various applications. 

The term "polynucleotide" as used herein means a polymeric form of 
nucleotides of any length, either ribonucleotides or dcoxyribonucleotides. This term 
35 refers only to the primary structure of the molecule. Thus, the term includes double- 
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and single-stranded DNA, as well as double- and single-stranded RNA. It also 
includes modifications, such as methylation or capping and unmodified forms of the 
polynucleotide. The terms "polynucleotide;' "oligomer," "oligonucleotide/* and 
"oligo" are used interchangeably herein. 
5 "A sequence corresponding to a cDNA" means that the sequence contains a 

polynucleotide sequence that is identical or complementary to a sequence in the 
designated DNA. The degree (or "percent") of identity or complementarity to the 
cDNA will be approximately 50% or greater, preferably at least about 70% or 
greater, and more preferably at least about 90% or greater. The sequence that 

10 corresponds to the identified cDNA will be at least about 50 nucleotides in length, 
preferably at least about 60 nucleotides in length, and more preferably at least about 
70 nucleotides in length. The correspondence between the gene or gene fragment of 
interest and the cDNA can be determined by methods known in the art and include, 
for example, a direct comparison of the sequenced material with the cDNAs 

15 described, or hybridization and digestion with single strand nucleases, followed by 
size determination of the digested fragments. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment 
thereof which is essentially free, e.g., contains less than about 50%, preferably less 
than about 70%, and more preferably less than about 90%, of the protein with which 

20 the polynucleotide is naturally associated. Techniques for purifying polynucleotides 
of interest are well-known in the art and include, for example, disruption of the cell 
containing the polynucleotide with a chaotropic agent and separation of the 
polynucleotide(s) and proteins by ion-exchange chromatography, affinity 
chromatography and sedimentation according to density. 

25 "Purified polypeptide" or "purified protein" means a polypeptide of interest 

or fragment thereof which is essentially free of, e.g., contains less than about 50%, 
preferably less than about 70%, and more preferably less than about 90%, cellular 
components with which the polypeptide of interest is naturally associated. Methods 
for purifying polypeptides of interest are known in the art. 

30 The term "isolated" means that the material is removed from its original 

environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally-occurring polynucleotide or polypeptide present in a living 
animal is not isolated, but the same polynucleotide or DNA or polypeptide, which is 
separated from some or all of the coexisting materials in the natural system, is 

35 isolated. Such polynucleotide could be part of a vector and/or such polynucleotide 
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or polypeptide could be part of a composition, and still be isolated in that the vector 
or composition is not part of its natural environment. 

"Polypeptide" and "protein" are used interchangeably herein and indicate at 
least one molecular chain of amino acids linked through covalent and/or non-covalent 
5 bonds. The terms do not refer to a specific length of the product. Thus peptides, 
oligopeptides and proteins arc included within the definition of polypeptide. The 
terms include post-translational modifications of the polypeptide, for example, 
glycosylates, acetylations, phosphorylations and the like. In addition, protein 
fragments, analogs, mutated or variant proteins, fusion proteins and the like are 
10 included within the meaning of polypeptide. 

A "fragment" of a specified polypeptide refers to an amino acid sequence 
which comprises at least about 3-5 amino acids, more preferably at least about 8-10 
amino acids, and even more preferably at least about 15-20 amino acids derived from 
the specified polypeptide. 
15 "Recombinant host cells" "host cells," "cells " "cell lines," "cell cultures" 

and other such terms denoting microorganisms or higher eukaryotic cell lines 
cultured as unicellular entities refer to cells which can be, or have been, used as 
recipients for recombinant vectors or other transferred DNA, and include the original 
progeny of the original cell which has been transfected. 
20 As used herein "replicon" means any genetic element, such as a plasmid, a 

chromosome or a virus, that behaves as an autonomous unit of polynucleotide 
replication within a cell. 

A "vector" is a replicon in which another polynucleotide segment is attached, 
such as to bring about the replication and/or expression of the attached segment. 
25 The term "control sequence" refers to a polynucleotide sequence which is 

necessary to effect the expression of a coding sequence to which it is ligated. The 
nature of such control sequences differs depending upon the host organism. In 
prokaryotes, such control sequences generally include a promoter, a ribosomal 
binding site, and terminators; in eukaryotes, such control sequences generally 
30 include promoters, terminators and, in some instances, enhancers. The term 

"control sequence" thus is intended to include at a minimum all components whose 
presence is necessary for expression, and also may include additional components 
whose presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described are 
35 in a relationship permitting them to function in their intended manner. Thus, for 
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example, a control sequence "opcrably linked" to a coding sequence is ligatcd in 
such a manner that expression of the coding sequence is achieved under conditions 
compatible with the control sequence. 

The term "open reading frame" or "ORF" refers to a region of a 
5 polynucleotide sequence which encodes a polypeptide. This region may represent a 
portion of a coding sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is transcribed into 
mRNA and translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are 

1 0 determined by a translation start codon at the 5' -terminus and a translation stop 
codon at the 3' -terminus. A coding sequence can include, but is not limited to, 
mRNA, cDNA and recombinant polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of 
epitope(s) and polypeptide(s) which also are present in and are unique to the 

15 designated polypeptide(s). Immunological identity may be determined by antibody 
binding and/or competition in binding. These techniques are known to the routineer 
and also are described herein. The uniqueness of an epitope also can be determined 
by computer searches of known data banks, such as GenBank, for the 
polynucleotide sequence which encodes the epitope and by amino acid sequence 

20 comparisons with other known proteins. 

As used herein, "epitope" means an antigenic determinant of a polypeptide or 
protein. Conceivably, an epitope can comprise three amino acids in a spatial 
conformation which is unique to the epitope. Generally, an epitope consists of at 
least five such amino acids and more usually, it consists of at least eight to ten amino 

25 acids. Methods of examining spatial conformation are known in the art and include, 
for example, x-ray crystallography and two-dimensional nuclear magnetic 
resonance. 

A "conformational epitope" is an epitope that is comprised of specific 
juxtaposition of amino acids in an immunologically recognizable structure, such 
30 amino acids being present on the same polypeptide in a contiguous or non- 
contiguous order or present on different polypeptides. 

A polypeptide is "immunologically reactive" with an antibody when it binds 
to an antibody due to antibody recognition of a specific epitope contained within the 
polypeptide. Immunological reactivity may be determined by antibody binding, 
35 more particularly, by the kinetics of antibody binding, and/or by competition in 
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binding using as competiior(s) a known poiypeptide(s) containing an epitope against 
which the antibody is directed. The methods for determining whether a polypeptide 
is immunologically reactive with an antibody are known in the art. 

As used herein, the term "immunogenic polypeptide containing an epitope of 

5 interest" means naturally occurring polypeptides of interest or fragments thereof, as 
well as polypeptides prepared by other means, for example, by chemical synthesis or 
the expression of the polypeptide in a recombinant organism. 

The term "transfection" refers to the introduction of an exogenous 
polynucleotide into a prokaryotic or eucaryotic host cell, irrespective of the method 

10 used for the introduction. The term "transfection" refers to both stable and transient 
introduction of the polynucleotide, and encompasses direct uptake of 
polynucleotides, transformation, transduction, and f-mating. Once introduced into 
the host cell, the exogenous polynucleotide may be maintained as a non-integrated 
replicon, for example, a plasmid, or alternatively, may be integrated into the host 

15 genome. 

"Treatment" refers to prophylaxis and/or therapy. 
The term "individual" as used herein refers to vertebrates, particularly 
members of the mammalian species and includes, but is not limited to, domestic 
animals, sports animals, primates and humans; more particularly the term refers to 

20 humans. 

The term "sense strand" or "plus strand" (or "+") as used herein denotes a 
nucleic acid that contains the sequence that encodes the polypeptide. The term 
"antisense strand" or "minus strand" (or "-") denotes a nucleic acid that contains a 
sequence that is complementary to that of the "plus" strand. 

25 The term "test sample" refers to a component of an individual's body which 

is the source of the analyte (such as, antibodies of interest or antigens of interest). 
These components are well known in the art. A test sample is typically anything 
suspected of containing a target sequence. Test samples can be prepared using 
methodologies well known in the art such as by obtaining a specimen from an 

30 individual and, if necessary, disrupting any cells contained thereby to release target 
nucleic acids. These test samples include biological samples which can be tested by 
the methods of the present invention described herein and include human and animal 
body fluids such as whole blood, scrum, plasma, cerebrospinal fluid, sputum, 
bronchial washing, bronchial aspirates, urine, lymph fluids and various external 

35 secretions of the respiratory, intestinal and genitourinary tracts, tears, saliva, milk, 
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white blood cells, myelomas and the like; biological fluids such as cell culture 
supernatams; tissue specimens which may be fixed; and cell specimens which may 
be fixed. 

"Purified product" refers to a preparation of the product which has been 

5 isolated from the cellular constituents with which the product is normally associated 
and from other types of cells which may be present in the sample of interest. 

"PNA" denotes a "peptide nucleic acid analog" which may be utilized in a 
procedure such as an assay described herein to determine the presence of a target. 
"MA" denotes a "morpholino analog" which may be utilized in a procedure such as 

10 an assay described herein to determine the presence of a target. See, for example, 
U.S. Patent No. 5,378,841 . PNAs arc neutrally charged moieties which can be 
directed against RN A targets or DNA. PNA probes used in assays in place of, for 
example, the DNA probes of the present invention, offer advantages not achievable 
when DNA probes are used. These advantages include manufacturability, large 

15 scale labeling, reproducibility, stability, insensitivity to changes in ionic strength and 
resistance to enzymatic degradation which is present in methods utilizing DNA or 
RNA. These PNAs can be labeled with ("attached to") such signal generating 
compounds as fluorescein, radionucleotides, chemiluminescent compounds and the 
like. PNAs or other nucleic acid analogs such as MAs thus can be used in assay 

20 methods in place of DNA or RNA. Although assays are described herein utilizing 
DNA probes, it is within the scope of the routineer that PNAs or MAs can be 
substituted for RNA or DNA with appropriate changes if and as needed in assay 
reagents. 

"Analytc," as used herein, is the substance to be detected which may be 
25 present in the test sample. The analyte can be any substance for which there exists a 
naturally occurring specific binding member (such as, an antibody), or for which a 
specific binding member can be prepared. Thus, an analyte is a substance that can 
bind to one or more specific binding members in an assay. "Analyte" also includes 
any antigenic substances, haptens, antibodies and combinations thereof. As a 
30 member of a specific binding pair, the analyte can be detected by means of naturally 
occurring specific binding partners (pairs) such as the use of intrinsic factor protein 
as a member of a specific binding pair for the determination of Vitamin B 12, the use 
of folate-binding protein to determine folic acid, or the use of a lectin as a member of 
a specific binding pair for the determination of a carbohydrate. The analyte can 
35 include a protein, a polypeptide, an amino acid, a nucleotide target and the like. 
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"Diseases of the lung/* or 'inng diseasi 
herein, refer to any disease or condition of the I 
not limited to, pneumonia (of all origins, indue 
asthma, black lung disease, silicosis, adult resp : 
cancer. 



"condition of the lung/' as used 
respiratory tract including, but 
iral, bacterial, and fungal), 
ry distress syndrome, and 



malignant disease of the lower 
:ell carcinoma, adenocarcinoma, 
Lung cancers are frequently 
.!1 carcinomas. 

Mcrs to the partial sequence of a 



i giving the quantitative distribution 
in the tissue from which the 

h utilize specific binding 
.crcin, is a member of a specific 



"Lung cancer," as used herein, refers to 

respiratory tract including, but not limited to, sir 

squamous cell carcinoma, and large cell carcino) 

grouped into small cell carcinoma and non-smal; 
10 An "Expressed Sequence Tag" or "EST" 

cDNA insert which has been made by reverse tr; iscription of mRNA extracted from 

a tissue followed by insertion into a vector. 

A "transcript image" refers to a table or 1 

of ESTs in a library and represents the genes act 
15 library was made. 

The present invention provides assays w 

members. A "specific binding member," as usc< 

binding pair. That is, two different molecules u r, _-rc one of the molecules, through 

chemical or physical means, specifically binds to the second molecule. Therefore, in 
20 addition to antigen and antibody specific binding pairs of common immunoassays, 

other specific binding pairs can include biotin and avidin, carbohydrates and lectins, 

complementary nucleotide sequences, effector and receptor molecules, cofactors and 

enzymes, enzyme inhibitors and enzymes and ih: like. Furthermore, specific 

binding pairs can include members that arc anaic :s of the original specific binding 
25 members, for example, an analytc-analog. Imin \ueactive specific binding 

members include antigens, antigen fragments, a; bodies and antibody fragments, 

both monoclonal and polyclonal and complexes 

recombinant DNA molecules. 

The term "hapten," as used herein, refer? 
30 binding member which is capable of binding to 

of eliciting antibody formation unless coupled t: 
A "capture reagent," as used herein, refc 

member which is specific either for the analyte ; 
indicator reagent or analyte as in a competitive 
35 binding member, which itself is specific for the 



jreof, including those formed by 

a partial antigen or non-protein 
antibody, but which is not capable 
carrier protein. 

,o an unlabeled specific binding 
:i a sandwich assay, for the 
y. or for an ancillary specific 
ulyte, as in an indirect assay. The 



WO 98/20143 



PCTAJS97/20680 



capture i w „:ii can be directly or indirectly bound to a solid phase material before the 
performs of the assay or during the performance of the assay, thereby enabling 
the sepan ;i ;;i of immobilized complexes from the test sample. 

Ti c "indicator reagent" comprises a "signal-generating compound" ("label") 
5 which is c . :; ;ble of generating and generates a measurable signal detectable by 

external means, conjugated ("attached") to a specific binding member. In addition to 
being an ninbody member of a specific binding pair, the indicator reagent also can 
be a member of any specific binding pair, including either hapten-anti-hapten 
systems such as biotin or anti-biotin, avidin or biotin, a carbohydrate or a lectin, a 

1 0 complementary nucleotide sequence, an effector or a receptor molecule, an enzyme 
cofacior and an enzyme, an enzyme inhibitor or an enzyme and the like. An 
immunoreactive specific binding member can be an antibody, an antigen, or an 
antibody /ami gen complex that is capable of binding either to the polypeptide of 
interest as in a sandwich assay, to the capture reagent as in a competitive assay, or to 

15 the ancillary specific binding member as in an indirect assay. When describing 
probes and probe assays, the term "reporter molecule" may be used. A reporter 
molecule comprises a signal generating compound as described hereinabove 
conjugated to a specific binding member of a specific binding pair, such as carbazole 
or adamantane. 

20 The various "signal-generating compounds" (labels) contemplated include 

chromagens, catalysts such as enzymes, luminescent compounds such as fluorescein 
and rhodamine, chemiluminescent compounds such as dioxetanes, acridiniums, 
phenanthridiniums and luminol, radioactive elements and direct visual labels. 
Examples of enzymes include alkaline phosphatase, horseradish peroxidase, beta- 
25 galactosidasc and the like. The selection of a particular label is not critical, but it 
must be capable of producing a signal either by itself or in conjunction with one or 
more additional substances. 

"Solid phases" ("solid supports") are known to those in the art and include 
the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic or non- 
30 magnetic beads, nitrocellulose strips, membranes, microparticles such as latex 
particles, sheep (or other animal) red blood cells and Duracytes® (red blood cells 
"fixed" by pyruvic aldehyde and formaldehyde, available from Abbott Laboratories, 
Abbott Park, DL) and others. The "solid phase" is not critical and can be selected by 
one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
35 magnetic beads, membranes, plastic lubes, walls of microliter wells, glass or silicon 
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chips, sheep (or other suitable animal's) red blood cells and Duracytes ,jy arc all 
suitable examples. Suitable methods for immobilizing peptides on solid phases 
include ionic, hydrophobic, covalent interactions and the like. A "solid phase," as 
used herein, refers to any material which is insoluble, or can be made insoluble by a 
5 subsequent reaction. The solid phase can be chosen for its intrinsic ability to attract 
and immobilize the capture reagent. Alternatively, the solid phase can retain an 
additional receptor which has the ability to attract and immobilize the capture reagent. 
The additional receptor can include a charged substance that is oppositely charged 
with respect to the capture reagent itself or to a charged substance conjugated to the 

10 capture reagent. As yet another alternative, the receptor molecule can be any specific 
binding member which is immobilized upon (attached to) the solid phase and which 
has the ability to immobilize the capture reagent through a specific binding reaction. 
The receptor molecule enables the indirect binding of the capture reagent to a solid 
phase material before the performance of the assay or during the performance of the 

15 assay. The solid phase thus can be a plastic, derivatized plastic, magnetic or non- 
magnetic metal, glass or silicon surface of a test tube, microtiter well, sheet, bead, 
microparticle, chip, sheep (or other suitable animal's) red blood cells, Duracytes® 
and other configurations known to those of ordinary skill in the art. 

It is contemplated and within the scope of the present invention that the solid 

20 phase also can comprise any suitable porous material with sufficient porosity to 

allow access by detection antibodies and a suitable surface affinity to bind antigens. 
Microporous structures generally arc preferred, but materials with a gel structure in 
the hydrated state may be used as well. Such useful solid supports include, but are 
not limited to, nitrocellulose and nylon. It is contemplated that such porous solid 

25 supports described herein preferably are in the form of sheets of thickness from 
about 0.01 to 0.5 mm, preferably about 0.1 mm. The pore size may vary within 
wide limits and preferably is from about 0.025 to 15 microns, especially from about 
0.15 to 15 microns. The surface of such supports may be activated by chemical 
processes which cause covalent linkage of the antigen or antibody to the support. 

30 The irreversible binding of the antigen or antibody is obtained, however, in general, 
by adsorption on the porous material by poorly understood hydrophobic forces. 
Other suitable solid supports are known in the art. 
Reagents 

The present invention provides reagents such as polynucleotide sequences 
35 derived from a lung tissue of interest and designated as LU 1 03, polypeptides 
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encoded thereby and antibodies specific for these polype; ; Jes. The present 
invention also provides reagents such as oligonucleotide ^ments derived from the 
disclosed polynucleotides and nucleic acid sequences corrpiementary to these 
polynucleotides. The polynucleotides, polypeptides, or axibodies of the present 
5 invention may be used to provide information leading to ■ :i • detecting, diagnosing, 
staging, monitoring, prognosticating, preventing or treating of, or determining the 
predisposition to, diseases and conditions of the lung, sut;h as lung cancer. The 
sequences disclosed herein represent unique polynucleotides which can be used in 
assays or for producing a specific profile of gene transcription activity. Such assays 

10 are disclosed in European Patent Number 0373203B1 and International Publication 
No. WO 95/11995. 

Selected LU103-derived polynucleotides can be used in the methods 
described herein for the detection of normal or altered gene expression. Such 
methods may employ LU103 polynucleotides or oligonucleotides, fragments or 

15 derivatives thereof, or nucleic acid sequences complementary thereto. 

The polynucleotides disclosed herein, their complementary sequences, or 
fragments of either, can be used in assays to detect, amplify or quantify genes, 
nucleic acids, cDNAs or mRNAs relating to lung tissue disease and conditions 
associated therewith. They also can be used to identify an entire or partial coding 

20 region of an LUI03 polypeptide. They further can be provided in individual 

containers in the form of a kit for assays, or provided as individual compositions. If 
provided in a kit for assays, other suitable reagents such as buffers, conjugates and 
the like may be included. 

The polynucleotide may be in the form of RNA or DNA. Polynucleotides in 

25 the form of DNA, cDNA, genomic DNA, nucleic acid analogs and synthetic DNA 
are within the scope of the present invention. The DNA may be double-stranded or 
single-stranded, and if single stranded, may be the coding (sense) strand or non- 
coding (anti-sense) strand. The coding sequence which encodes the polypeptide 
may be identical to the coding sequence provided herein or may be a different coding 

30 sequence which coding sequence, as a result of the redundancy or degeneracy of the 
genetic code, encodes the same polypeptide as the DNA provided herein. 

This polynucleotide may include only the coding sequence for the 
polypeptide, or the coding sequence for the polypeptide and an additional coding 
sequence such as a leader or secretory sequence or a propi otein sequence, or the 

35 coding sequence for the polypeptide (and optionally an additional coding sequence) 
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and a non-coding sequence, such as a non-coding sequence 5" and/or 3' of the 
coding sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing 
modifications s;tch as polynucleotide deletions, substitutions or additions; and any 
5 polypeptide modification resulting from the variant polynucleotide sequence. A 
polynucleotide of the present invention also may have a coding sequence which is a 
naturally occurring allelic variant of the coding sequence provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the 
same reading frame to a polynucleotide sequence which aids in expression and 
1 0 secretion of a polypeptide from a host cell, for example, a leader sequence which 
functions as a secretory sequence for controlling transport of a polypeptide from the 
cell. The polypeptide having a leader sequence is a preprotcin and may have the 
leader sequence cleaved by the host cell to form the polypeptide. The 
polynucleotides may also encode for a proprotein which is the protein plus additional 
15 5' amino acid residues. A protein having a prosequence is a proprotein and may, in 
some cases, be an inactive form of the protein. Once the prosequence is cleaved an 
active protein remains. Thus, the polynucleotide of the present invention may 
encode for a protein, or for a protein having a prosequence, or for a protein having 
both a presequence (leader sequence) and a prosequence. 
20 The polynucleotides of the present invention may also have the coding 

sequence fused in frame to a marker sequence which allows for purification of the 
polypeptide of the present invention. The marker sequence may be a hexa-histidine 
tag supplied by a pQE-9 vector to provide for purification of the polypeptide fused to 
the marker in the case of a bacterial host, or, for example, the marker sequence may 
25 be a hemagglutinin (HA) tag when a mammalian host, e.g. a COS-7 cell line, is 
used. The HA tag corresponds to an epitope derived from the influenza 
hemagglutinin protein. See, for example, I. Wilson et al., Cell 37:767 (1984). 

It is contemplated that polynucleotides will be considered to hybridize to the 
sequences provided herein if there is at least 50%, preferably at least 70%, and more 
30 preferably at least 90% identity between the polynucleotide and the sequence. 

The present invention also provides an antibody produced by using a purified 
LU103 polypeptide of which at least a portion of the polypeptide is encoded by an 
LU103 polynucleotide selected from the polynucleotides provided herein. These 
antibodies may be used in the methods provided herein for the detection of LU103 
35 antigen in test samples. The presence of LU 103 antigen in the test samples is 
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indicative of the presence of a lung disc . condition. The antibody also may be 
used for therapeutic purposes, for exam *i neutralizing the activity of LU 1 03 
polypeptide in conditions associated \vu crcd or abnormal expression. 

The present invention further rei lo an LU103 polypeptide which has the 
5 deduced amino acid sequence as provic' rein, as well as fragments, analogs and 
derivatives of such polypeptide. The poi; 'ptidc of the present invention may be a 
recombinant polypeptide, a natural purif polypeptide or a synthetic polypeptide. 
The fragment, derivative or analog of the , U 103 polypeptide may be one in which 
one or more of the amino acid residues H tbstituted with a conserved or non- 
10 conserved amino acid residue (preferably i conserved amino acid residue) and such 
substituted amino acid residue may or may not be one encoded by the genetic code; 
or it may be one in which one or more of the amino acid residues includes a 
substitucnt group; or it may be one in which the polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for 
15 example, polyethylene glycol); or it may one in which the additional amino acids 
are fused to the polypeptide, such as a le; t or secretory sequence or a sequence 
which is employed for purification of tlv ilypeptide or a proprotein sequence. 
Such fragments, derivatives and analogs a. e within the scope of the present 
invention. The polypeptides and polynucleotides of the present invention are 
20 provided preferably in an isolated form and preferably purified. 

Thus, a polypeptide of the present invention may have an amino acid 
sequence that is identical to that of the naairally occurring polypeptide or that is 
different by minor variations due to one or more amino acid substitutions. The 
variation may be a "conservative change" ■ ypically in the range of about 1 lo 5 amino 
25 acids, wherein the substituted amino acid ! us similar structural or chemical 

properties, e.g., replacement of leucine vrh tsoleucine or threonine with serine. In 
contrast, variations may include nonconsv vative changes, e.g., replacement of a 
glycine with a tryptophan. Similar minor variations may also include amino acid 
deletions or insertions, or both. Guidanc n determining which and how many 
30 amino acid residues may be substituted, i .zned or deleted without changing 

biological or immunological activity ma;. : found using computer programs well 
known in the art, for example, DNAST/- : software (DNASTAR Inc., Madison 
WI). 

Probes constructed according to polynucleotide sequences of the present 
35 invention can be used in various assay n: : hods to provide various types of analysis. 
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For example, such probes can be used in fluorescent . jtu hybridization (FISH) 
technology to perform chromosomal analysis, and us :.' *o identify cancer-specific 
structural alterations in the chromosomes, such as del.:., ons or translocations that are 
visible from chromosome spreads or detectable using \- 7R-generated and/or allele 
r specific oligonucleotides probes, allele specific ampli : uion or by direct 

sequencing. Probes also can be labeled with radioisotopes, directly- or indirectly- 
detectable haptens, or fluorescent molecules, and util; d for in situ hybridization 
studies to evaluate the mRNA expression of the gene comprising the polynucleotide 
in tissue specimens or cells. 
10 This invention aJso provides teachings as to the production of the 

polynucleotides and polypeptides provided herein. 
Probe Assays 

The sequences provided herein may be used to produce probes which can be 
used in assays for the detection of nucleic acids in test samples. The probes may be 

1 5 designed from conserved nucleotide regions of the polynucleotides of interest or 

from non-conserved nucleotide regions of the polynucleotide of interest. The design 
of such probes for optimization in assays is within tin: > kill of the routineer. 
Generally, nucleic acid probes are developed from non-conserved or unique regions 
when maximum specificity is desired, and nucleic acid probes are developed from 

20 conserved regions when assaying for nucleotide regions that arc closely related to, 
for example, different members of a multi-gene family or in related species like 
mouse and man. 

The polymerase chain reaction (PCR) is a technique for amplifying a desired 
nucleic acid sequence (target) contained in a nucleic acid or mixture thereof. In 

25 PCR, a pair of primers are employed in excess to hybridize to the complementary 
strands of the target nucleic acid. The primers are each extended by a polymerase 
using the target nucleic acid as a template. The extension products become target 
sequences themselves, following dissociation from the original target strand. New 
primers then are hybridized and extended by a polymcrxse, and the cycle is repeated 

30 to geometrically increase the number of target sequence molecules. PCR is disclosed 
in U.S. Patent Nos 4,683,195 and 4,683,202. 

The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid 
amplification. In LCR, probe pairs are used which include two primary (first and 
second) and two secondary (third and fourth) probes, all of which are employed in 

35 molar excess to target. The first probe hybridizes to a first segment of the target 
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strand, and the s ; ~ond probe hybridizes to a second segment of the target stran .e 
first and second -egmcnts being contiguous so that the primary probes abut onr 
another in 5' ph sphate-3' hydroxy! relationship, and so thai a ligase can covak > y 
fuse or ligate th* two probes into a fused product. In addition, a third (seconda 
probe can hybri- :zc to a portion of the first probe and a fourth (secondary) pro! in 
hybridize to a portion of the second probe in a similar abutting fashion. Of cou * . if 
the target is initially double stranded, the secondary probes also will hybridize t 'ic 
target complement in the first instance. Once the ligated strand of primary prob ; is 
separated from the target strand, it will hybridize with the third and fourth probt 
which can be ligated to form a complementary, secondary ligated product. It is 
important to realize that the ligated products are functionally equivalent to either iic 
target or its complement. By repeated cycles of hybridization and ligation, 
amplification of the target sequence is achieved. This technique is described mc e 
completely in EP-A- 320 308, published June 1 6, 1989, and EP-A-439 1 82, 
published July 31, 1991. 

For amplification of mRNAs, it is within the scope of the present invent 
to reverse transcribe mRNA into cDNA followed by polymerase chain reaction IT- 
PCR); or, to use a single enzyme for both steps as described in U.S. Patent Nc 
5,322,770; or reverse transcribe mRNA into cDNA followed by asymmetric ga) 
ligase chain reaction (RT-AGLCR) as described by R.L. Marshall et ah, PCR 
Methods and Applications 4:80-84 (1994). 

Other known amplification methods which can be utilized herein include but 
are not limited to the so-called "NASBA" or "3SR" technique described by J.C 
Guatelli, et al., PNAS USA 87:1874-1878 (1990) and also described by J. 
Compton, Nature 350 (No. 6313):91-92 (1991); Q-beta amplification as descri: jd in 
published European Patent Application (EPA) No. 4544610; strand displaceme t 
amplification (as described in G.T. Walker et al. f Clin. Chem . 42:9-13 (1996)) md 
European Patent Application No. 684315; and target mediated amplification, as 
described in International Publication No. WO 93/22461. 

Detection of LU103 may be accomplished using any suitable detection 
method, includhg those detection methods which are currently well known in t ; 
art, as well as detection strategics which may evolve later. See, for example, C -.sxey 
et al., and U.S. Patent Nos 5,582,989 and 5,210,015. Examples of such detec -<m 
methods include target amplification methods as well as signal amplification 
technologies. An example of presently known detection methods would includ t'.ie 
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nucleic acid amplification technologies re . red to as PCR, LCR, NASBA, SDA, 
RCR and TMA. Sec. for example, U.S. :'cnt No. 5.582,989 to Caskey et a!., and 
U.S. Patent No. 5,210,015 Gelfand et al. -jtection may also be accomplished 
using signal amplification such as that di ! oscd in U.S. Patent No. 5,273,882 to 
5 Snitman et al.. While the amplification o. : rgel or signal is preferred at present, it is 
contemplated and within the scope of the ] csent invention that ultrasensitive 
detection methods which do not require ar r. !ification can be utilized herein. 

Detection, both amplified and non implificd, may be (combined) carried out 
using a variety of heterogeneous and horn -gencous detection formats. Examples of 

10 heterogeneous detection formats are diseased in U.S. Patent No. 5,273,882 to 
Snitman et al., EP-841 14441.9, U.S. Pat-nt No. 5,124,246 to Urdea et al., U.S. 
Patent No. 5.185,243 to Ullman et al. an 1 U.S. Patent No. 4,581,333 to Kourilsky 
et al.. Examples of homogeneous detection formats are disclosed in U.S. Patent 
No. 5,582,989 to Caskey et al., and U.S. Patent No. 5,210,015 to Gelfand et al. 

15 Also contemplated and within the scope o the present invention is the use of 
multiple probes in the hybridization assay which use improves sensitivity and 
amplification of the LU 103 signal. See, »r example, U.S. Patent Nos. 5,582,989 
and 5,210,015. 

In one embodiment, the present invention generally comprises the steps of 
20 contacting a test sample suspected of containing a target polynucleotide sequence 
with amplification reaction reagents comprising an amplification primer, and a 
detection probe that can hybridize with an internal region of the amplicon sequences. 
Probes and primers employed according to the method provided herein are labeled 
with capture and detection labels, whereir probes are labeled with one type of label 
25 and primers are labeled with another type M' label. Additionally, the primers and 
probes are selected such that the probe sec uence has a lower melt temperature than 
the primer sequences. The amplification i :agents, detection reagents and test sample 
are placed under amplification conditions whereby, in the presence of target 
sequence, copies of the target sequence (n amplicon) are produced. In the usual 
30 case, the amplicon is double stranded bee use primers are provided to amplify a 

target sequence and its complementary st; ind. The double stranded amplicon then is 
thermally denatured to produce single str; :dcd amplicon members. Upon formation 
of the single stranded amplicon members lie mixture is cooled to allow the 
formation of complexes between the prob s and single stranded amplicon members. 
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As the single stranded amplicon sequences and probe sequences are cooled, 
th nrobe sequences preferentially bind the single stranded amplicon members. This 
fi ling is counterintuitive given that the probe sequences generally arc selected to be 
si* oner than the primer sequences and therefore have a lower melt temperature than 
5 th : primers. Accordingly, the melt temperature of the amplicon produced by the 
primers should also have a higher melt temperature than the probes. Thus, as the 
mixture cools, the re-formation of the double stranded amplicon would be expected. 
As previously stated, however, this is not the case. The probes are found to 
preferentially bind the single stranded amplicon members. Moreover, this preference 

10 of probe/single stranded amplicon binding exists even when the primer sequences 
are added in excess of the probes. 

After the probe/single stranded amplicon member hybrids arc formed, they 
are detected. Standard heterogeneous assay formats arc suitable for detecting the 
hybrids using the detection labels and capture labels present on the primers and 

15 probes. The hybrids can be bound to a solid phase reagent by virtue of the capture 
label and detected by virtue of the detection label. In cases where the detection label 
is directly detectable, the presence of the hybrids on the solid phase can be detected 
by causing the label to produce a detectable signal, if necessary, and detecting the 
signal. In cases where the label is not directly detectable, the captured hybrids can 

20 be contacted with a conjugate, which generally comprises a binding member attached 
to a directly detectable label. The conjugate becomes bound to the complexes and the 
conjugates presence on the complexes can be detected with the directly detectable 
label. Thus, the presence of the hybrids on the solid phase reagent can be 
determined. Those skilled in the art will recognize that wash steps may be employed 

25 to wash away unhybridized amplicon or probe as well as unbound conjugate. 

Although the target sequence is described as single stranded, it also is 
contemplated to include the case where the target sequence is actually double 
stranded but is merely separated from its complement prior to hybridization with the 
amplification primer sequences. In the case where PCR is employed in this method, 

30 the ends of the target sequences are usually known. In cases where LCR or a 

modification thereof is employed in the preferred method, the entire target sequence 
is usually known. Typically, the target sequence is a nucleic acid sequence such as, 
for example, RNA or DNA. 

The method provided herein can be used in well-known amplification 

35 reactions that include thermal cycle reaction mixtures, particularly in PCR and gap 



WO 98/20143 



PCT/US97/20680 



LCR (GLCR). Amplification reactions typically employ primers to repeatedly 
generate copies of a target nucleic acid sequence, which target sequence is usually a 
small region of a much larger nucleic acid sequence. Primers are themselves nucleic 
acid sequences that are complementary to regions of a target sequence. Under 
5 amplification conditions, these primers hybridize or bind to the complementary 

regions of the target sequence. Copies of the target sequence typically are generated 
by the process of primer extension and/or ligation which utilizes enzymes with 
polymerase or ligase activity, separately or in combination, to add nucleotides to the 
hybridized primers and/or ligate adjacent probe pairs. The nucleotides that are added 

10 to the primers or probes, as monomers or preformed oligomers, are also 

complementary to the target sequence. Once the primers or probes have been 
sufficiently extended and/or ligated, they are separated from the target sequence, for 
example, by heating the reaction mixture to a "melt temperature" which is one in 
which complementary nucleic acid strands dissociate. Thus, a sequence 

1 5 complementary to the target sequence is formed. 

A new amplification cycle then can take place to further amplify the number 
of target sequences by separating any double stranded sequences, allowing primers 
or probes to hybridize to their respective targets, extending and/or Iigating the 
hybridized primers or probes and re-scparating. The complementary sequences that 

20 are generated by amplification cycles can serve as templates for primer extension or 
filling the gap of two probes to further amplify the number of target sequences. 
Typically, a reaction mixture is cycled between 20 and 100 times, more typically, a 
reaction mixture is cycled between 25 and 50 times. The numbers of cycles can be 
determined by the routineer. In this manner, multiple copies of the target sequence 

25 and its complementary sequence are produced. Thus, primers initiate amplification 
of the target sequence when it is present under amplification conditions. 

Generally, two primers which are complementary to a portion of a target 
strand and its complement are employed in PCR. For LCR, four probes, two of 
which are complementary to a target sequence and two of which are similarly 

30 complementary to the target's complement, are generally employed. In addition to 
the primer sets and enzymes previously mentioned, a nucleic acid amplification 
reaction mixture may also comprise other reagents which arc well known and include 
but are not limited to: enzyme cofactors such as manganese; magnesium; salts; 
nicotinamide adenine dinucleotide (NAD); and deoxynucleotide triphosphates 
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(dNTPs) such, as for example, deoxyadenine ; hate, dcoxyguanine 

triphosphate, deoxycytosine triphosphate and ( = ymine triphosphate. 

While the amplification primers initiate . ,;i .cation of the target sequence, 
the detection (or hybridization) probe is not inv :d *i amplification. Detection 
5 probes are generally nucleic acid sequences or : !cd nucleic acid analogs such 
as, for example, peptide nucleic acids which an t . c sed in International 
Publication No. WO 92/20702: morpholino an ! \; vhich arc described in U.S. 
Patents Nos 5,185,444, 5,034,506 and 5 f 142,CI a k1 the like. Depending upon 
the type of label carried by the probe, the probe i:: .-ir.ployed to capture or detect the 

10 amplicon generated by the amplification reaction The probe is not involved in 
amplification of the target sequence and therefore may have to be rendered "non- 
extendible" in that additional dNTPs cannot be ad Jed to the probe. In and of 
themselves analogs usually are non-extendible and nucleic acid probes can be 
rendered non-extendible by modifying the 3* erd < >f the probe such that the hydroxyl 

15 group is no longer capable of participating in cU> ^aMon. For example, the 3' end of 
the probe can be functionalized with the capture- *. i Jc lection label to thereby 
consume or otherwise block the hydroxyl grou \ \\ amatively, the 3' hydroxyl 
group simply can be cleaved, replaced or modilicd. U.S. Patent Application Serial 
No. 07/049,061 filed April 19, 1993-describes modifications which can be used to 

20 render a probe non-extendible. 

The ratio of primers to probes is not important. Thus, either the probes or 
primers can be added to the reaction mixture in excess whereby the concentration of 
one would be greater than the concentration of the other. Alternatively, primers and 
probes can be employed in equivalent concentration:*.. Preferably, however, the 

25 primers are added to the reaction mixture in excess cf the probes. Thus, primer to 
probe ratios of, for example, 5:1 and 20: 1 are pn.*:en*ed. 

While the length of the primers and probes can vary, the probe sequences are 
selected such that they have a lower melt temperature than the primer sequences. 
Hence, the primer sequences are generally lonvc ±i\n the probe sequences. 

30 Typically, the primer sequences are in the rang: •:■ ;' between 20 and 50 nucleotides 
long, more typically in the range of between 2( . : J '0 nucleotides long. The typical 
probe is in the range of between 10 and 25 nuc:j 'aczs long. 

Various methods for synthesizing prim r , ami probes are well known in the 
art. Similarly, methods for attaching labels to ; ne:?; or probes are also well known 

35 in the art. For example, it is a matter of routine to synthesize desired nucleic acid 
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p. ;i:iers or probes using conventional nucleotide phosphoramiditc chemistry and 
i ;trumcnts available from Applied Biosystems, Inc., (Foster City, CA), DuPont 
( ; vViiinington, DE), or Milligen (Bedford MA). Many methods have been described 
for labeling oligonucleotides such as the primers or probes of the present invention. 
I .iv/.o Biochemical (New York, NY) and Clontech (Palo Alto, CA) both have 
described and commercialized probe labeling techniques. For example, a primary 
amine can be attached to a 3' oligo terminus using 3-Amine-ON CPG™ (Clontech, 
Palo Alto, CA). Similarly, a primary amine can be attached to a 5* oligo terminus 
using Aminomodifier II 0 (Clontech). The amines can be reacted to various haptens 
1 0 using conventional activation and linking chemistries. In addition, copending 
applications U.S. Serial Nos. 625,566, filed December 1 1, 1990, and 630,908, 
filed December 20, 1990, teach methods for labeling probes at their 5' and 3* 
termini, respectively. International Publication Nos WO 92/10505, published 25 
June 1992, and WO 92/1 1388, published 9 July 1992, teach methods for labeling 
1 5 probes at their 5' and 3* ends, respectively. According to one known method for 
labeling an oligonucleotide, a label-phosphoramidite reagent is prepared and used to 
add the label to the oligonucleotide during its synthesis. See, for example, NT. 
Thuong et ah, Tel Letters 29(46):5905-5908 (1988); or J.S. Cohen et ah, published 
U.S. Patent Application 07/246,688 (NTIS ORDER No. PAT-APPL-7-246,688) 
20 (1989). Preferably, probes are labeled at their 3' and 5' ends. 

A capture label is attached to the primers or probes and can be a specific 
binding member which forms a binding pair with the solid phase reagent's specific 
binding member. It will be understood that the primer or probe itself may serve as 
the capture label. For example, in the case where a solid phase reagent's binding 
25 member is a nucleic acid sequence, it may be selected such that it binds a 

complementary portion of the primer or probe to thereby immobilize the primer or 
probe to the solid phase. In cases where the probe itself serves as the binding 
member, those skilled in the an will recognize that the probe will contain a sequence 
or "tail" that is not complementary to the single stranded amplicon members. In the 
30 case where the primer itself serves as the capture label, at least a portion of the 
primer will be free to hybridize with a nucleic acid on a solid phase because the 
probe is selected such that it is not fully complementary to the primer sequence. 

Generally, probe/single stranded amplicon member complexes can be 
detected using techniques commonly employed to perform heterogeneous 
35 immunoassays. Preferably, in this embodiment, detection is performed according to 
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the protocols used by the commercially available Abbott LCx e instrumentatioi 
(Abbott Laboratories, Abbott Park, IL). 

The primers and probes disclosed herein are useful in typical PCR assa /s, 
wherein the test sample is contacted with a pair of primers, amplification is 

5 performed, the hybridization probe is added, and detection is performed. 

Another method provided by the present invention comprises contacting a 
test sample with a plurality of polynucleotides, wherein at least one polynudeoi :de is 
an LU103 molecule as described herein, hybridizing the test sample with the 
plurality of polynucleotides and detecting hybridization complexes. Hybridization 

1 0 complexes are identified and quantitated to compile a profile which is indicative of 
lung tissue disease, such as lung cancer. Expressed RNA sequences may further be 
detected by reverse transcription and amplification of the DNA product by 
procedures well-known in the art, including polymerase chain reaction (PCR). 
Dnip Screening and Gene Therapy . 

15 The present invention also encompasses the use of gene therapy methods for 

the introduction of anti-sense LU103 derived molecules, such as polynucleotid e or 
oligonucleotides of the present invention, into patients with conditions associated 
with abnormal expression of polynucleotides related to a lung tissue disease or 
condition, especially lung cancer. These molecules, including antisense RNA and 

20 DNA fragments and ribozymes, are designed to inhibit the translation of LU103- 
mRNA, and may be used therapeutically in the treatment of conditions associated 
with altered or abnormal expression of an LU103 polynucleotide. 

Alternatively, the oligonucleotides described above can be delivered to cells 
by procedures known in the an such that the anti-sense RNA or DNA may be 

25 expressed in vivo to inhibit production of an LU 103 polypeptide in the manner 
described above. Antisense constructs to an LU 103 polynucleotide, therefore, 
reverse the action of LU 103 transcripts and may be used for treating lung tissue 
disease conditions, such as lung cancer. These antisense constructs may also be 
used to treat tumor metastases. 

30 The present invention also provides a method of screening a plurality of 

compounds for specific binding to LU103 polypeptide(s), or any fragment thereof, 
to identify at least one compound which specifically binds the LU103 polypeptide. 
Such a method comprises the steps of providing at least one compound; combining 
the LU103 polypeptide with each compound under suitable conditions for a time 
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sufficicnt to allow binding; and detecting the LU103 poI> peptide binding to each 
compound. 

The polypeptide or peptide fragment employed in such a test may either be 
free in solution, affixed to a solid support, borne on a eel! surface or located 
5 intracellular^. One method of screening utilises eukaryciic or prokaryotic host cells 
which are stably transfected with recombinant nucleic acids which can express the 
polypeptide or peptide fragment. A drug, compound, or any other agent, may be 
screened against such transfected cells in competitive binding assays. For example, 
the formation of complexes between a polypeptide and the agent being tested can be 
10 measured in either viable or fixed cells. 

The present invention thus provides methods of screening for drugs, 
compounds, or any other agent which can be used to treat diseases associated with 
LU103. These methods comprise contacting the agent with a polypeptide or 
fragment thereof and assaying for either the presence of a complex between the agent 
1 5 and the polypeptide, or for the presence of a complex between the polypeptide and 
the cell. In competitive binding assays, the polypeptide typically is labeled. After 
suitable incubation, free (or uncomplexed) polypeptide or fragment thereof is 
separated from that present in bound form, and the amount of free or uncomplexed 
label is used as a measure of the ability of the particular agent to bind to the 
20 polypeptide or to interfere with the polypeptide/cell complex. 

The present invention also encompasses the use of competitive screening 
assays in which neutralizing antibodies capable of binding polypeptide specifically 
compete with a test agent for binding to the polypeptide or fragment thereof. In this 
manner, the antibodies can be used to detect the presence of any polypeptide in the 
25 test sample which shares one or more antigenic determinants with an LU103 
polypeptide as provided herein. 

Another technique for screening provides high throughput screening for 
compounds having suitable binding affinity to at least one polypeptide of LU103 
disclosed herein. Briefly, large numbers of different small peptide test compounds 
30 are synthesized on a solid phase, such as plastic pins or some other surface. The 
peptide test compounds are reacted with polypeptide and washed. Polypeptide thus 
bound to the solid phase is detected by methods well-known in the art. Purified 
polypeptide can also be coated directly onto plates for use in the screening techniques 
described herein. In addition, non-neutralizing antibodies can be used to capture the 
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pol cptidc ai C immobilize it on the solid support. See, for example, EP 84/03564, 
pul ; ;hed on September 13, 1984. 

The gcai of rational drug design is to produce structural analogs of 
bio rzically active polypeptides of interest or of the small molecules including 
5 age : r;ts, anta< cnists, or inhibitors with which they interact. Such structural analogs 
can )c used to design drugs which are more active or stable forms of the polypeptide 
or v iiich enhance or interfere with the function of a polypeptide in vivo. J. 
Hodgson, Biotechnology 9:19-21 (1991). 



1 0 polypeptide, or of a polypeptidc-inhibitor complex, is determined by x-ray 

crystallography, by computer modeling or, most typically, by a combination of the 
two approaches. Both the shape and charges of the polypeptide must be ascertained 
to elucidate the structure and to determine active site(s) of the molecule. Less often, 
usei 'ul information regarding the structure of a polypeptide may be gained by 

15 modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design analogous polypeptide-like molecules or to 
identify efficient inhibitors 

Useful examples of rational drug design may include molecules which have 
improved activity or stability as shown by S. Braxton et al., Biochemistry 3 1:7796- 

20 780 1 ( 1 992), or which act as inhibitors, agonists, or antagonists of native peptides 
as shown by S.B.P. Athauda et al. f J. Biochem. (Tokvo^ 1 13 (6):742-746 (1993). 

It also is possible to isolate a target-specific antibody selected by an assay as 
described hereinabove, and then to determine its crystal structure. In principle, this 
approach yields a pharmacophore upon which subsequent drug design can be based. 

25 It further is possible to bypass protein crystallography altogether by generating anti- 
idiotypic antibodies ("anti-ids") to a functional, pharmacologically active antibody. 
As a mirror image of a mirror image, the binding site of the anti-id is an analog of the 
original receptor. The anti-id then can be used to identify and isolate peptides from 
banks of chemically or biologically produced peptides. The isolated peptides then 

30 can act as the pharmacophore (that is, a prototype pharmaceutical drug). 

A sufficient amount of a recombinant polypeptide of the present invention 
ma\ be made available to perform analytical studies such as X-ray crystallography. 
In a Idition, knowledge of the polypeptide amino acid sequence which is derivable 
fro: i the nucleic acid sequence provided herein will provide guidance to those 



For example, in one approach, the three-dimensional structure of a 
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employing computer modeling techniques in place of, or in addition to, x-ray 
crystallography. 

Antibodies specific to an LU103 polypeptide (e.g., anti-LU103 antibodies) 
further may be used to inhibit the biological action of the polypeptide by binding to 

5 the polypeptide. In this manner, the antibodies may be used in therapy, for example, 
to treat lung tissue diseases, including lung cancer and its metastases. 

Further, such antibodies can detect the presence or absence of the LU103 
polypeptide in a test sample and, therefore, are useful as diagnostic markers for the 
diagnosis of a lung tissue disease or condition, especially lung cancer. Such 

10 antibodies may also function as a diagnostic marker for lung tissue disease 

conditions, such as lung cancer. The present invention also is directed to antagonists 
and inhibitors of the polypeptides of the present invention. The antagonists and 
inhibitors arc those which inhibit or eliminate the function of the polypeptide. Thus, 
for example, an antagonist may bind to a polypeptide of the present invention and 

15 inhibit or eliminate its function. The antagonist, for example, could be an antibody 
against the polypeptide which eliminates the activity of the LU103 polypeptide by 
binding the LU103 polypeptide, or in some cases the antagonist may be an 
oligonucleotide. Examples of small molecule inhibitors include, but are not limited 
to, small peptides or peptide-like molecules. 

20 The antagonists and inhibitors may be employed as a composition with a 

pharmaceutical^ acceptable carrier, including, but not limited to, saline, buffered 
saline, dextrose, water, glycerol, ethanol and combinations thereof. Administration 
of LU103 polypeptide inhibitors is preferably systemic. The present invention also 
provides an antibody which inhibits the action of such a polypeptide. 

25 Antisense technology can be used to reduce gene expression through triple- 

helix formation or antisense DN A or RNA, both of which methods are based on 
binding of a polynucleotide to DN A or RNA. For example, the 5* coding portion of 
the polynucleotide sequence, which encodes for the polypeptide of the present 
invention, is used to design an antisense RNA oligonucleotide of from 10 to 40 base 

30 pairs in length. A DNA oligonucleotide is designed to be complementary to a region 
of the gene involved in transcription, thereby preventing transcription and the 
production of the LU103 polypeptide. For triple helix, see, for example, Lee et al, 
Nuc. Acids Res . 6:3073 (1979): Cooney et al, Science 241 :456 (1988); and Dervan 
et al, Science 25 1: 1360 ( 1991) The antisense RNA oligonucleotide hybridizes to the 

35 mRNA in vivo and blocks translation of a mRNA molecule into the LU103 
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polypeptidc. For antisense, see, for example, ( 
(1991); and "Oligodeoxynucleotides as Antisen 
CRC Press, Boca Raton, Fla. (1988). Antisens: 
efficacy when modified to contain artificial intei 
5 molecule resistant to nucleolytic cleavage. Sucl 
include, but arc not limited to, mcthylphosphon 
phosphoroamydate internucleotide linkages. 
Recombinant Technology . 
The present invention provides host eel! 

10 LU103 polynucleotides of the present invention 
the polypeptide(s) they encode. Such methods c 
under conditions suitable for the expression of t' 
recovering the LU103 polypeptide from the cell 
The present invention also provides vect 

15 polynucleotides of the present invention, host ce 
with vectors of the present invention and the pr( 
present invention by recombinant techniques. 

Host cells are genetically engineered (tr: 
with the vectors of this invention which may be 

20 vectors. The vector may be in the form of a plas 
The engineered host cells can be cultured in com 
appropriate for activating promoters, selecting tr; 
LU103 gene(s). The culture conditions, such a. 1 
those previously used with the host cell selectee 

25 to the ordinarily skilled artisan. 

The polynucleotides of the present invcr 
a polypeptide by recombinant techniques. Thus 
be included in any one of a variety of expressior 
plasmids for expressing a polypeptide. Such ve 

30 nonchromosomal and synthetic DNA sequence; 
plasmids; phage DNA; yeast plasmids; vectors c 
plasmids and phage DNA, viral DNA such as v 
and pseudorabies. However, any other plasmid 
is replicable and viable in the host. 



; , J. Neurochem . 56:560 
libitors of Gene Expression," 
^nucleotides act with greater 
!: otide linkages which render the 
r cial internucleotide linkages 
phosphorothiolate and 
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• appropriate DNA sequence may be inserted into the vector by a variety 
of pro res. In general, the DNA sequence is inserted into appropriate restriction 
endon asc sites by procedures known in the art. Such procedures and others are 
deemc ' be within the scope of those skilled in the art. The DNA sequence in the 
5 expre: vector is operatively. linked to an appropriate expression control 

seque, ; :. ;) (promoter) to direct mRNA synthesis. Representative examples of such 
promc include, but are not limited to, the LTR or the SV40 promoter, the coH 
lac or .rp. the phage lambda P sub L promoter and other promoters known to control 
expression of genes in prokaryotic or eukaryotic cells or their viruses. The 

10 expression vector also contains a ribosomc binding site for translation initiation and a 
transciipiion terminator. The vector may also include appropriate sequences for 
amplifying expression. In addition, the expression vectors preferably contain a gene 
to provide a phenotypic trait for selection of transfected host cells such as 
dihydi ofolatc reductase or neomycin resistance for eukaryotic cell culture, or such as 

1 5 tetraeve ' ine or ampicillin resistance in R coli . 

The vector containing the appropriate DNA sequence as hereinabove 
descri' 1 J. as well as an appropriate promoter or control sequence, may be employed 
to transfect an appropriate host to permit the host to express the protein. As 
representative examples of appropriate hosts, there may be mentioned: bacterial cells, 

20 such as jL coli . Salmonella typhimurium : Strcptomyces sp.: fungal cells, such as 
yeast; insect cells such as Drosophila and Sf9; animal cells such as CHO, COS or 
Bowes melanoma; plant cells, etc. The selection of an appropriate host is deemed to 
be within the scope of those skilled in the art from the teachings provided herein. 

More particularly, the present invention also includes recombinant constructs 

25 comprising one or more of the sequences as broadly described above. The 
constructs comprise a vector, such as a plasmid or viral vector, into which a 
sequence of the invention has been inserted, in a forward or reverse orientation. In a 
preferred aspect of this embodiment, the construct further comprises regulatory 
sequer.cjs, including, for example, a promoter, operably linked to the sequence. 

30 Large numbers of suitable vectors and promoters arc known to those of skill in the 
art and ::rc commercially available. The following vectors are provided by way of 
exam]-.]. 1 . Bacterial: pINCY (Incyte Pharmaceuticals Inc., Palo Alto, CA), 
pSP0*71 (Life Technologies, Gaithersburg, MD), pQE70, pQE60, pQE-9 (Qiagen) 
pBs, i prescript, psiX174, pBluescript SK, pBsKS, pNH8a, pNH16a, pNH18a, 

35 pNH46a (Slralagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 
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(Pharniacia); Eukaryotic: pWLneo, pSV2cai T pOG44, pXTl, pSG (Straiagene) 
pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any other plasmid or vector 
may be used as long as it is replicable and viable in the host. 

Plasmid pINCY is generally identical to the plasmid pSPORTl (available 
5 from Life Technologies. Gaithersburg, MD) with the exception that it has two 

modifications in the polylinker (multiple cloning site). These modifications are (1) it 
lacks a Hindm restriction site and (2) its EcoRI restriction site lies at a different 
location. pINCY is created from pSPORTl by cleaving pSPORTl with both 
Hindm and EcoRI and replacing the excised fragment of the polylinker with 

10 synthetic DNA fragments (SEQUENCE ID NO 6 and SEQUENCE ID NO 7). This 
replacement may be made in any manner known to those of ordinary skill in the art. 
For example, the two nucleotide sequences, SEQUENCE ID NO 6 and SEQUENCE 
ID NO 7, may be generated synthetically with 5' terminal phosphates, mixed 
together, and then ligated under standard conditions for performing staggered end 

15 ligations into the pSPORTl plasmid cut with Hindin and EcoRI. Suitable host cells 
(such as E, coli DH5~ cells) then are transfected with the ligated DNA and 
recombinant clones are selected for ampicillin resistance. Plasmid DNA then is 
prepared from individual clones and subjected to restriction enzyme analysis or DNA 
sequencing in order to confirm the presence of insert sequences in the proper 

20 orientation. Other cloning strategies known to the ordinary artisan also may be 
employed. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. Two 
appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters 

25 include lad, lacZ, T3, SP6, T7, gpt, lambda P sub R, P sub L and tip. Eukaryotic 
promoters include cytomegalovirus (CM V) immediate early, herpes simplex virus 
(HSV) thymidine kinase, early and late SV40, LTRs from retroviruses and mouse 
metallothionein-I. Selection of the appropriate vector and promoter is well within the 
level of ordinary skill in the art. 

30 In a further embodiment, the present invention provides host cells containing 

the above-described construct. The host cell can be a higher eukaryotic cell, such as 
a mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can 
be a prokaryotic cell, such as a bacterial cell. Introduction of the construct into the 
host cell can be effected by calcium phosphate transfection, DEAE-Dextran mediated 

35 transfection, or clectroporation (L. Davis el al. f "Basic Methods in Molecular 
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Biology," 2nd edition, Applcton and Lang, Pare Publishing, East Norwalk, 
CT(1994)). 

The constructs in host cells can be used i ivcntional manner to produce 
the gene product encoded by the recombinant sc Alternatively, the 

5 polypeptides of the invention can be syntheticall; iced by conventional peptide 
synthesizers. 

Recombinant proteins can be expressed i ? lmalian cells, yeast, bacteria, 
or other cells, under the control of appropriate pr ; ;?rs. Cell-free translation 
systems can also be employed to produce such p: ; .is using RNAs derived from 

10 the DNA constructs of the present invention. Ap: r ^riate cloning and expression 
vectors for use with prokaryotic and eukaryotic hos s arc described by Sambrook et 
al., Molecular Cloning: A Laboratory Manual . Se c id Edition, (Cold Spring 
Harbor, N.Y., 1989). 

Transcription of a DNA encoding the pol : lide(s) of the present invention 

15 by higher eukaryotes is increased by inserting an :ncer sequence into the vector. 
Enhancers are cis-acting elements of DNA, usua "out from 10 to 300 bp, that act 
on a promoter to increase its transcription. Exan include the SV40 enhancer on 
the late side of the replication origin (bp 100 to 27) . a cytomegalovirus early 
promoter enhancer, a polyoma enhancer on the hi: ;idc of the replication origin and 

20 adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication 
and selectable markers permitting transfection of the host cell, e.g., the ampicillin 
resistance gene of IL Mi and S . cerevisiac TRP1 gene, and a promoter derived from 
a highly-expressed gene to direct transcription of lownstream structural sequence. 

25 Such promoters can be derived from operons enc; c ng glycolytic enzymes such as 
3-phosphoglycerate kinase (PGK), alpha factor, ■ : .1 phosphatase, or heat shock 
proteins, among others. The heterologous struct;!;;.! sequence is assembled in 
appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing / :retion of translated protein into 

30 the periplasmic space or extracellular medium. C ,nally, the heterologous 

sequence can encode a fusion protein including r. : ■ crminal identification peptide 
imparting desired characteristics, e.g., stabilizati. r simplified purification of 
expressed recombinant product. 

Useful expression vectors for bacterial u . constructed by inserting a 

35 structural DNA sequence encoding a desired pro' ;■ together with suitable translation 
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initiat; termination signals in operable reading phase with a functional 
promc >.e vector will comprise one or more phenotypic selectable markers and 
an orir. replication to ensure maintenance of the vector and to, if desirable, 
provid . ■ lification within the host. Suitable prokaryotic hosts for transfection 
5 inciud ^j, Bacillus subtilis . Salmonella tvphimurium and various species within 
the gen s 'seudomonas . Strcptomvces and Staphylococcus , although, others may 
also be vloyed as a routine matter of choice. 

/seful expression vectors for bacterial use comprise a selectable marker and 
bactcri: origin of replication derived from plasmids comprising genetic elements of 

10 the well known cloning vector pBR322 (ATCC 37017). Other vectors include but 
are not limited to PKK223-3 (Pharmacia Fine .Chemicals, Uppsala, Sweden) and 
GEM I f Promega Biotec, Madison, WI). These pBR322 "backbone" sections are 
combined with an appropriate promoter and the structural sequence to be expressed. 
Following transfection of a suitable host and growth of the host to an 

15 approp* is : cell density, the selected promoter is derepressed by appropriate means 
(e.g., teinj-crature shift or chemical induction), and cells are cultured for an 
additio; :i! period. Cells are typically harvested by centrifugation, disrupted by 
physical or chemical means, and the resulting crude extract retained for further 
purification. Microbial cells employed in expression of proteins can be disrupted by 

20 any convenient method, including freeze-thaw cycling, sonication, mechanical 

disruption, or use of cell lysing agents; such methods are well-known to the ordinary 
artisan. 

Various mammalian cell culture systems can also be employed to express 
recombinant protein. Examples of mammalian expression systems include the COS- 

25 7 lines of monkey kidney fibroblasts described by Gluzman, CeU 23:175 (1981), 
and other cell lines capable of expressing a compatible vector, such as the C127, 
HEK-293, 3T3, CHO, HeLa and BHK cell lines. Mammalian expression vectors 
will comprise an origin of replication, a suitable promoter and enhancer and also any 
necessn y ribosome binding sites, polyadenylation sites, splice donor and acceptor 

30 sites, transcriptional termination sequences and 5' flanking nontranscribed 

sequen . DNA sequences derived from the SV40 viral genome, for example, 
SV40 o:i£:n, early promoter, enhancer, splice, and polyadenylation sites may be 
used to :; ovide the required nontranscribed genetic elements. Representative, useful 
vector: • iude pRC/CMV and pcDNA3 (available from Invitrogen, San Diego, 

35 CA). 
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LU103 polypeptides arc recovered and purified from recombinant cell 
cultures by known methods including affinity chromatography, ammonium sulfate 
or ethanol precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
5 hydroxyapatitc chromatography or lectin chromatography. It is preferred to have 
low concentrations (approximately 0.1-5 mM) of calcium ion present during 
purification (Price, et al., J. Biol. Chem . 244:917 (1969)). Protein refolding steps 
can be used, as necessary, in completing configuration of the polypeptide. Finally, 
high performance liquid chromatography (HPLC) can be employed for final 

10 purification steps. 

Thus, polypeptides of the present invention may be naturally purified 
products expressed from a high expressing cell line, or a product of chemical 
synthetic procedures, or produced by recombinant techniques from a prokaryotic or 
eukaryotic host (for example, by bacterial, yeast, higher plant, insect and mammalian 

15 cells in culture). Depending upon the host employed in a recombinant production 
procedure, the polypeptides of the present invention may be glycosylated with 
mammalian or other eukaryotic carbohydrates or may be non-glycosylated. The 
polypeptides of the invention may also include an initial methionine amino acid 
residue. 

20 The starting plasmids can be constructed from available plasmids in accord 

with published, known procedures. In addition, equivalent plasmids to those 
described are known in the art and will be apparent to the ordinarily skilled artisan. 

The following is the general procedure for the isolation and analysis of 
cDNA clones. In a particular embodiment disclosed herein, mRNA was isolated 

25 from lung tissue and used to generate the cDNA library. Lung tissue was obtained 
from patients by surgical resection and was classified as tumor or non-tumor tissue 
by a pathologist. 

The cDNA inserts from random isolates of the lung tissue libraries were 
sequenced in part, analyzed in detail as set forth in the Examples and are disclosed in 

30 the Sequence Listing as SEQUENCE ID NO 1, SEQUENCE ID NO 2, 

SEQUENCE ID NO 3. The consensus sequence of these inserts is presented as 
SEQUENCE ID NO 4. These polynucleotides may contain an entire open reading 
frame with or without associated regulatory sequences for a particular gene, or they 
may encode only a portion of the gene of interest. This is attributed to the fact that 

35 many genes are several hundred and sometimes several thousand bases in length 
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The '-id sequences disclosed herein may be joined to a variety c 

other polyni ; sequences and vectors of interest by means of well-establi : i 
recombinan . jchniqucs. Sec J. Sambrook et ah, supra . Vectors of inten 
include clor : : ors, such as plasmids, cosmids, phage derivatives, phagen 

5 as well as sc g, replication and expression vectors, and the like. In gent 

such vectors n an origin of replication functional in at least one organism, 
convenient k r^n endonuclea.se digestion sites and selectable markers 
appropriate f. articular host cells. The vectors can be transferred by a variety .>/ 
means known *hose of skill in the art into suitable host cells which then prod ce 

10 the desired DN V RNA or polypeptides. 

Occasionally, sequencing or random reverse transcription errors will musk 
the presence of the appropriate open reading frame or regulatory element. In su ;h 
cases, it is possible to determine the correct reading frame by attempting to expr ess 
the polypcpiic^ and determining the amino acid sequence by standard peptide 

15 mapping ano Slicing techniques. See, F.M. Ausubel et al., Cunrent Protoc A tin 
Molecular BtH ;iv. John Wiley & Sons, New York, NY (1989). Additionally the 
actual readin.: -viic of a given nucleotide sequence may be determined by 
transfection of host cells with vectors containing all three potential reading franr es. 
Only those cells with the nucleotide sequence in the correct reading frame will 

20 produce a peptide of the predicted length. 

The nucleotide sequences provided herein have been prepared by current, 
state-of-the-art. automated methods and as such may contain unidentified 
nucleotides. These will not present a problem to those skilled in the art who wish to 
practice the invention. Several methods employing standard recombinant 

25 techniques, dj::.:ibed in J. Sambrook ( supra ) or periodic updates thereof, may ho 
used to compL c the missing sequence information. The same techniques used tor 
obtaining a ft:!; Isngth sequence, as described herein, may be used to obtain 
nucleotide sequences. 

Exprc : of a particular cDNA may be accomplished by subcloning tl : 

30 cDNA into ur. :;.;)ropriate expression vector and transfecting this vector into an 

appropriate e .:;v/,ion host. The cloning vector used for the generation of the :;ng 
tissue cDNA : : nvry can be used for transcribing mRNA of a particular cDNA i ;d 
contains a pn , *.*cr for bcta-galactosidase, an amino-tcrminal met and the 
subsequent s. ..: ::mino acid residues of beta-galactosidase. Immediately folic v:ng 

35 these eight residues is an engineered bacteriophage promoter, useful for artificial 
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priming and transcription, as well as a number of unique restriction sites, including 
EcoRI. for cloning. The vector can be transfected into an appropriate host strain of 
E. coli. 

Induction of the isolated bacterial strain with isopropylthiogalactoside (1PTG) 
5 using standard methods will produce a fusion protein which contains the first seven 
residues of beta-galactosidase, about 15 residues of linker and the peptide encoded 
within the cDNA. Since cDNA clone inserts are generated by an essentially random 
process, there is one chance in three that the included cDNA will lie in the correct 
frame for proper translation. If the cDNA is not in the proper reading frame, the 
10 correct frame can be obtained by deletion or insertion of an appropriate number of 
bases by well known methods including in vitro mutagenesis, digestion with 
exonuclcasc III or mung bean nuclease, or oligonucleotide linker inclusion. 

The cDNA can be shuttled into other vectors known to be useful for 
expression of protein in specific hosts. Oligonucleotide primers, containing cloning 
1 5 sites and segments of DNA sufficient to hybridize to stretches at both ends of the 
target cDNA, can be synthesized chemically by standard methods. These primers 
can then be used to amplify the desired gene segments by PCR. The resulting new 
gene segments can be digested with appropriate restriction enzymes under standard 
conditions and isolated by gel electrophoresis. Alternately, similar gene segments 
20 can be produced by digestion of the cDNA with appropriate restriction enzymes and 
filling in the missing gene segments with chemically synthesized oligonucleotides. 
Segments of the coding sequence from more than one gene can be ligated together 
and cloned in appropriate vectors to optimize expression of recombinant sequence. 
Suitable expression hosts for such chimeric molecules include but are not 
25 limited to, mammalian cells such as Chinese Hamster Ovary (CHO) and human 

embryonic kidney (HEK) 293 cells, insect cells such as Sf9 cells, yeast cells such as 
Saccharomvees cerevisiae and bacteria such as E. coli . For each of these cell 
systems, a useful expression vector may also include an origin of replication to allow 
propagation in bacteria and a selectable marker such as the beta-lactamase antibiotic 
30 resistance gene to allow selection in bacteria. In addition, the vectors may include a 
second selectable marker, such as the neomycin phosphotransferase gene, to allow 
selection in transfected eukaryotic host cells. Vectors for use in eukaryotic 
expression hosts may require the addition of 3' poly A tail if the sequence of interest 
lacks poly A. 
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the conditioned medium and analyzed using chromatogi a 
10 in the art. An alternative method for the production of hi 

protein involves the transfection of mammalian embryos . 

recombinant protein from milk produced by transgenic c 

Polypeptides and closely related molecules may be expre: ;cd recombinantly in such 

a way as to facilitate protein purification. One approacl i ol ves expression of a 
1 5 chimeric protein which includes one or more additional \ 

naturally present on human polypeptides. Such purifica i 

include, but are not limited to, metal-chciating peptides ;i 

domains that allow purification on immobilized metals, p; 

purification on immobilized immunoglobulin and the domain utilized in the FLAGS 
20 extension/affinity purification system (Immuncx Corp, Seattle, WA). The inclusion 

of a clcavable linker sequence such as Factor XA or enterokinase from Invitrogen 

(San Diego, CA) between the polypeptide sequence and the purification domain may 

be useful for recovering the polypeptide. 
Immunoassays 

25 LU103 polypeptides, including fragments, deriv; ;vcs, and analogs thereof, 

or cells expressing such polypeptides, can be utilized in ; ■ aricty of assays, many of 
which are described herein, for the detection of antibodic . ;o lung tissue. They also 
can be used as immunogens to produce antibodies. Thes: antibodies can be, for 
example, polyclonal or monoclonal antibodies, chimeric ::rr!e chain and humanized 

30 antibodies, as well as Fab fragments, or the product of i; 
Various procedures known in the art may be used for tl 
antibodies and fragments. 

For example, antibodies generated against a pol .■; 
sequence of the present invention can be obtained by di. . a 

35 polypeptide into an animal or by administering the polypi >.idc to an animal such as a 
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mouse, . 



■>at or human. A mouse, rabbit or goat is preferred. The 
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. icctcd from the group consisting of SEQUENCE ID NOS 14-19, 
hereof. The antibody so obtained then will bind the polypeptide 



:\»iner, even a sequence encoding only a fragment of the polypeptide 
:ncratc antibodies that bind the native polypeptide. Such antibodies 



5 



can be t ~ 



then can -d to isolate the polypeptide from test samples such as tissue suspected 
of contaii i -::t polypeptide. For preparation of monoclonal antibodies, any 
technique : irh provides antibodies produced by continuous cell line cultures can be 
used. Ex .pkrs include the hybridoma technique as described by Kohler and 

10 Milstein, Naiure 256:495-497 (1975), the trioma technique, the human B-cell 

hybridoma technique as described by Kozbor et al, Immun. Today 4:72 (1983) and 
the EB V-hybridoma technique to produce human monoclonal antibodies as described 
by Cole ci al., in Monoclonal Antibodies and Cancer Therapy . Alan R. Liss, Inc, 
New York, NY, pp. 77-96 (1985). Techniques described for the production of 

15 single ch.v.i nitibodics can be adapted to produce single chain antibodies to 

immunof.v n:.: polypeptide products of this invention. See T for example, U.S. Patent 
No. 4,94: -7S. 

Various assay formats may utilize the antibodies of the present invention, 
including 'sandwich" immunoassays and probe assays. For example, the antibodies 

20 of the present invention, or fragments thereof, can be employed in various assay 
systems to determine the presence, if any, of LU 103 antigen in a test sample. For 
example, in a first assay format, a polyclonal or monoclonal antibody or fragment 
thereof, or a combination of these antibodies, which has been coated on a solid 
phase, is contacted with a test sample, to form a first mixture. This first mixture is 

25 incubated for a time and under conditions sufficient to form antigen/antibody 

complexr:;. Then, an indicator reagent comprising a monoclonal or a polyclonal 
antibody or a fragment thereof, or a combination of these antibodies, to which a 
signal generating compound has been attached, is contacted with the 
antigen/ar ■ i!*ody complexes to form a second mixture. This second mixture then is 

30 incubated .or a time and under conditions sufficient to form 

antibody/, rr^n/antibody complexes. The presence of LU103 antigen in the test 
sample a:v.i captured on the solid phase, if any, is determined by detecting the 
measurah: * irnal generated by the signal generating compound. The amount of 
LU103 a:: :;":n present in the test sample is proportional to the signal generated. 
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In an alternative assay format, a mixture is formed by contacting: (1) a 
polyclonal antibody, monoclonal antibody, or fragment thereof, which specifically 
binds to LU 103 antigen, or a combination of such antibodies bound to a solid 
support; (2) the test sample; and (3) an indicator reagent comprising a monoclonal 
5 antibody, polyclonal antibody, or fragment thereof, which specifically binds to a 
different LU103 antigen (or a combination of these antibodies) to which a signal 
generating compound is attached. This mixture is incubated for a time and under 
conditions sufficient to form antibody/antigen/antibody complexes. The presence, if 
any, of LU103 antigen present in the test sample and captured on the solid phase is 
10 determined by detecting the measurable signal generated by the signal generating 
compound. The amount of LU103 antigen present in the test sample is proportional 
to the signal generated. 

In another assay format, one or a combination of at least two monoclonal 
antibodies of the invention can be employed as a ccmpetitive probe for the detection 
1 5 of antibodies to LU 103 antigen. For example, LU103 polypeptides such as the 

recombinant antigens disclosed herein, either alone or in combination, are coated on 
a solid phase. A test sample suspected of containing antibody to LU103 antigen then 
is incubated with an indicator reagent comprising a signal generating compound and 
at least one monoclonal antibody of the invention for a time and under conditions 
20 sufficient to form antigen/antibody complexes of either the test sample and indicator 
reagent bound to the solid phase or the indicator reagent bound to the solid phase. 
The reduction in binding of the monoclonal antibody to the solid phase can be 
quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal 
25 antibodies of the present invention can be employed in the detection of LU 1 03 
antigens in tissue sections, as well as in cells, by immunohistochemical analysis. 
Cytochemical analysis wherein these antibodies are labeled directly (with, for 
example, fluorescein, colloidal gold, horseradish peroxidase, alkaline phosphatase, 
etc.) or are labeled by using secondary labeled anti-species antibodies (with various 
30 labels as exemplified herein) to track the histopathology of disease also are within the 
scope of the present invention. 

In addition, these monoclonal antibodies can be bound to matrices similar to 
CNBr-activated Sepharose and used for the affinity purification of specific LU103 
polypeptides from cell cultures or biological tissues such as to purify recombinant 
35 and native LU 1 03 proteins. 
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The monoclonal antibodies of the invention a! used for the 

generation of chimeric antibodies for therapeutic use, similar applications. 

The monoclonal antibodies or fragments there v provided individually 
to detect LU103 antigens. Combinations of the monc. intibodies (and 
5 fragments thereof) provided herein also may be used ;is components in a 

mixture or "cocktail" of at least one LU 1 03 antibody o ovention, along with 
antibodies which specifically bind to other LU 103 re;: .:ch antibody having 
different binding specificities. Thus, this cocktail can the monoclonal 

antibodies of the invention which are directed to LU 1 03 ■: iy peptides disclosed 

10 herein and other monoclonal antibodies specific to other ntigenic determinants of 
LU 1 03 antigens or other related proteins. 

The polyclonal antibody or fragment thereof whi :h can be used in the assay 
formats should specifically bind to an LU103 polypeptide nr other LU103 
polypeptides additionally used in the assay. The polyclc : il antibody used 

1 5 preferably is of mammalian origin such as, human, go u ; ibbit or sheep polyclonal 
antibody which binds LU103 polypeptide. Most prefer;-. /. ihe polyclonal antibody 
is of rabbit origin. The polyclonal antibodies used in i^: .ays can be used either 
alone or as a cocktail of polyclonal antibodies. Since the cocktails used in the assay 
formats are comprised of either monoclonal antibodies o; polyclonal antibodies 

20 having different binding specificity to LU 103 polypeptides, they are useful for the 
detecting, diagnosing, staging, monitoring, prognosticating, preventing or treating, 
or determining the predisposition to, diseases and conditions of the lung such as 
lung cancer. 

It is contemplated and within the scope of the pre ^ni invention that LU103 
25 antigen may be detectable in assays by use of a recombir :ir.t antigen as well as by 

use of a synthetic peptide or purified peptide, which pcpiide comprises an amino acid 
sequence of LU 103. The amino acid sequence of such a polypeptide is selected 
from the group consisting of SEQUENCE ID NOS 14-19, and fragments thereof. It 
also is within the scope of the present invention that din :nt synthetic, recombinant 
30 or purified peptides, identifying different epitopes of Li 1 03, can be used in 
combination in an assay for the detecting, diagnosing. ? .-ring, monitoring, 
prognosticating, preventing or treating, or determining t . predisposition to diseases 
and conditions of the lung, such as lung cancer. In thir, .;se, all of these peptides 
can be coated onto one solid phase; or each separate pi*- j may be coated onto 
35 separate solid phases, such as microparticles, and then combined to form a mixture 
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of peptides w : be later used in assays. Furthermore, it is contemplated that 

multiple pept ' ; *;h define epitopes from different antigens may be used for the 
detection, dii . Maging, monitoring, prognosis, prevention or treatment of, or 
determining l' • "disposition to, diseases and conditions of the lung, such as lung 
5 cancer. Pepti i on solid phases or labeled with detectable labels are then 

allowed to co:- ] = ■ : with those present in a patient sample (if any) for a limited 
amount of am - A reduction in binding of the synthetic, recombinant, or 
purified peptics u> the antibody (or antibodies) is an indication of the presence of 
LU103 antigen in the patient sample. The presence of LU103 antigen indicates the 
10 presence of lup.g tissue disease, especially lung cancer, in the patient. Variations of 
assay formats aic known to those of ordinary skill in the art and many are discussed 
herein below. 

In another assay format, the presence of anti-LU103 antibody and/or LU103 
antigen can be delected in a simultaneous assay, as follows. A test sample is 
1 5 simultaneously < omacted with a capture reagent of a first analyte, wherein said 

capture reagent comprises a first binding member specific for a first analyte attached 
to a solid phar ; :md a capture reagent for a second analyte, wherein said capture 
reagent comprises a first binding member for a second analyte attached to a second 
solid phase, to thereby form a mixture. This mixture is incubated for a time and 
20 under conditions sufficient to form capture reagent/first analyte and capture 

reagent/second analyte complexes. These so-formed complexes then are contacted 
with an indicator reagent comprising a member of a binding pair specific for the first 
analyte labeled with a signal generating compound and an indicator reagent 
comprising a member of a binding pair specific for the second analyte labeled with a 
25 signal generating compound to form a second mixture. This second mixture is 
incubated for n time and under conditions sufficient to form capture reagent/first 
analyte/indicator reagent complexes and capture reagent/second analyte/indicator 
reagent complexes. The presence of one or more analytes is determined by detecting 
a signal generated in connection with the complexes formed on either or both solid 
30 phases as an indication of the presence of one or more analytes in the test sample. In 
this assay formal, recombinant antigens derived from the expression systems 
disclosed hercrr. may be utilized, as well as monoclonal antibodies produced from 
the proteins derived from the expression systems as disclosed herein. For example, 
in this assay :.;-:i-:ni. LU103 antigen can be the first analyte. Such assay systems are 
35 described in greater detail in EP Publication No. 0473065. 
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In yet other assay formats, the polypeptides disclosed herein may be utilized 
to detect the presence of antibody against LU103 antigen in test samples. For 
example, a test sample is incubated with a solid phase to which at least one 
polypeptide such as a recombinant protein or synthetic peptide has been attached. 
5 The polypeptide is selected from the group consisting of SEQUENCE ID NOS 14- 
19, and fragments thereof. These are reacted for a time and under conditions 
sufficient to form antigen/antibody complexes. Following incubation, the 
antigen/antibody complex is detected. Indicator reagents may be used to facilitate 
detection, depending upon the assay system chosen. In another assay format, a test 

10 sample is contacted with a solid phase to which a recombinant protein produced as 
described herein is attached, and also is contacted with a monoclonal or polyclonal 
antibody specific for the protein, which preferably has been labeled with an indicator 
reagent. After incubation for a time and under conditions sufficient for 
antibody/antigen complexes to form, the solid phase is separated from the free 

1 5 phase, and the label is detected in either the solid or free phase as an indication of the 
presence of antibody against LU 103 antigen. Other assay formats utilizing the 
recombinant antigens disclosed herein arc contemplated. These include contacting a 
test sample with a solid phase to which at least one antigen from a first source has 
been attached, incubating the solid phase and test sample for a time and under 

20 conditions sufficient to form antigen/antibody complexes, and then contacting the 
solid phase with a labeled antigen, which antigen is derived from a second source 
different from the first source. For example, a recombinant protein derived from a 
first source such as IL is used as a capture antigen on a solid phase, a test 
sample is added to the so-prepared solid phase, and following standard incubation 

25 and washing steps as deemed or required, a recombinant protein derived from a 
different source (i.e., non-IL cq!D is utilized as a part of an indicator reagent which 
subsequently is detected. Likewise, combinations of a recombinant antigen on a 
solid phase and synthetic peptide in the indicator phase also are possible. Any assay 
format which utilizes an antigen specific for LU 103 produced or derived from a first 

30 source as the capture antigen and an antigen specific for LU103 from a different 
second source is contemplated. Thus, various combinations of recombinant 
antigens, as well as the use of synthetic peptides, purified proteins and the like, are 
within the scope of this invention. Assays such as this and others are described in 
U.S. Patent No. 5,254,458, which enjoys common ownership herewith. 
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Other embodiments which utilize various othe 
contemplated and arc within the scope of this inventic 
procedures for immobilizing an immobilizable reactio;. 
charged polymer (described in EP publication 0326 1C 
5 (3406473), can be employed according to the present i 
solution-phase immunochemical reaction. An immobi 
separated from the rest of the reaction mixture by ioni< 
negatively charged poly-anion/immune complex and tl 
positively charged porous matrix and detected by usin: 
10 systems previously described, including those describe 
measurements as described in EPO Publication No. 0 
Also, the methods of the present invention can 
which utilize microparticlc technology including autom 
systems wherein the solid phase comprises a micropan 
15 magnetic). Such systems include those described in, i 
applications Nos. EP 0 425 633 and EP 0 424 634, re 
The use of scanning probe microscopy (SPM) 
technology to which the monoclonal antibodies of the i 
adaptable. In scanning probe microscopy, particularly atomic force microscopy, 
20 the capture phase, for example, at least one of the monoclonal antibodies of the 

invention, is adhered to a solid phase and a scanning probe microscope is utilized to 
detect antigen/antibody complexes which may be present on the surface of the solid 
phase. The use of scanning tunneling microscopy elinv -iscs the need for labels 
which normally must be utilized in many immunoassa* 
25 antigen/antibody complexes. The use of SPM to mom 
can occur in many ways. In one embodiment, one mc 
partner (analyte specific substance which is the monoc : 
invention) is attached to a surface suitable for scanning 
analyte specific substance may be by adsorption to a 
30 solid phase of a plastic or metal surface, following mc 
ordinary skill in the art. Or, covalent attachment of a r 
(analyte specific substance) to a test piece which test \; 
derivatized plastic, metal, silicon, or glass may be utiK 
methods are known to those skilled in the art and inch 
35 irreversibly link specific binding partners to the test pi* 
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example, ion capture 
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: TIP publication No. 

to effect a fast 
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o .i . .. the surface must be activated prior to attaching the specific binding partner. 
/ • v^iyelectrolyte interactions may be used to immobilize a specific binding 
p . it ;L. on a surface of a test piece by using techniques and chemistries. The 
p ; en ed method of attachment is by covalent means. Following attachment of a 
5 s -A binding member, the surface may be further treated with materials such as 
s ' xm. proteins, or other blocking agents to minimize non-specific binding. The 
s -! '!c also may be scanned either at the site of manufacture or point of use to verify 
it suitability for assay purposes. The scanning process is not anticipated to alter the 
specific binding properties of the test piece. 

10 While the present invention discloses the preference for the use of solid 

phases, it is contemplated that the reagents such as antibodies, proteins and peptides 
of the present invention can be utilized in non-solid phase assay systems. These 
assay systems are known to those skilled in the art, and are considered to be within 
the scope of the present invention. 

15 It is contemplated that the reagent employed for the assay can be provided in 

the form of a test kit with one or more containers such as vials or bottles, with each 
c luaiiiLT containing a separate reagent such as a probe, primer, monoclonal antibody 
or a cocktail of monoclonal antibodies, or a polypeptide (e.g. recombinantly, 
synthetically produced or purified) employed in the assay. The polypeptide is 

20 selected from the group consisting of SEQUENCE ID NOS 14-19, and fragments 
thereof. Other components such as buffers, controls and the like, known to those of 
ordinary skill in art, may be included in such test kits. It also is contemplated to 
provide test kits which have means for collecting test samples comprising accessible 
body fluids, e.g., blood, urine, saliva and stool. Such tools useful for collection 

25 ( collection materials") include lancets and absorbent paper or cloth for collecting 
and stabilizing blood; swabs for collecting and stabilizing saliva; cups for collecting 
and stabilizing urine or stool samples. Collection materials, papers, cloths, swabs, 
cups and the like, may optionally be treated to avoid denaturation or irreversible 
^sorption of the sample. The collection materials also may be treated with or 

30 c retain preservatives, stabilizers or antimicrobial agents to help maintain the integrity 
r * the specimens. Test kits designed for the collection, stabilization and preservation 
( : test specimens obtained by surgery or needle biopsy are also useful. It is 
c -memplated that all kits may be configured in two components which can be 
j c vided separately; one component for collection and transport of the specimen and 

35 t ! e other component for the analysis of the specimen. The collection component, for 
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example, can be provided to the open market user while the components for anal; 
can be provided to others such as laboratory personnel for determination of the 
presence, absence or amount of analytc. Further, kits for the collection, stabilize., 
and preservation of test specimens may be configured for use by untrained persor J 
and may be available in the open market for use at home with subsequent 
transportation to a laboratory for analysis of the test sample. 

Ej, coll bacteria (clone 1235095) has been deposited at the American Type 
Culture Collection (A.T.C.C.), 12301 Parklawn Drive, Rockville, Maryland 208:: 2, 
as of 10/7/96, under the terms of the Budapest Treaty and will be maintained for a 
period of thirty (30) years from the date of deposit, or for five (5) years after the last 
request for the deposit, or for the enforceable period of the U.S. patent, whichever is 
longer. The deposit and any other deposited material described herein are provided 
for convenience only, and arc not required to practice the present invention in view 
of the teachings provided herein. The cDNA sequence in all of the deposited 
15 material is incorporated herein by reference. Clone 1235095 was accorded 
A.T.C.C. Deposit No. 98184. 

The present invention will now be described by way of examples, which arc-, 
meant to illustrate, but not to limit, the scope of the present invention. 

20 EXAMPLES 

Fvnmple 1 : Identification of Lung Tissue Lib rary UJ103 Gene-Soecific Clones 
A. Library Comparison of Expressed Sequence Tap* (ESTs,) or Transcript 
Images. Partial sequences of cDNA clone inserts, so-called "expressed sequence 
tags" (ESTs), were derived from cDNA libraries made from lung tumor tissues, lung 
25 non-tumor tissues and numerous other tissues, both tumor and non-tumor and 
entered into a database (LIFESEQ™ database, available from Incyte 
Pharmaceuticals, Palo Alto, CA) as gene transcript images. See International 
Publication No. WO 95/20681. (A transcript image is a listing of the number of 
ESTs for each of the represented genes in a given tissue library. ESTs sharing 
30 regions of mutual sequence overlap are classified into clusters. A cluster is assigned 
a clone number from a representative 5' EST. Often, a cluster of interest can be 
extended by comparing its consensus sequence with sequences of other ESTs wh;..h 
did not meet the criteria for automated clustering. The alignment of all available 
clusters and single ESTs represent a contig from which a consensus sequence is 
35 derived.) The transcript images then were evaluated to identify EST clusters that 
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were representative primarily of the lung tiss^ 
were ranked according to their abundance (oc 
libraries and their absence from background 1 
with low background occurrence were given 
clones 1235095 (SEQUENCE ID NO 1), 12: 
1379417 (SEQUENCE ID NO 3) were idem, 
represented the minimum number of clones tl 
contig and from which the consensus scquem 
NO 4) was derived. 

B. Generation of a Consensus Segue : 
clones, 1235095 (SEQUENCE ID NO 1), 12.. 
1379417 (SEQUENCE ID NO 3), were enter, 
(available from Gene Codes Corporation, Am 
nucleotide alignment (contig map) and then g» 
(SEQUENCE ID NO 4). FIGURE I shows t! 
these clones and their resultant nucleotide cor 
4). FIGURE 2 presents the contig map depic 
SEQUENCE ID NO 2, SEQUENCE ID NO : 



ics. These target clusters then 
: of EST members) in the target 
... Higher abundance clusters 
;tudy priority. Overlapping 
SEQUENCE ID NO 2), and 
, r further study. These 
: : needed to form the LU103 
A-ided herein (SEQUENCE ID 

The nucleotide sequences of EST 
/ 1 (SEQUENCE ID NO 2), and 
■ < t the Sequencher™ Program 
bor, MI, in order to generate a 
ue their consensus sequence 
x lcotide sequence alignment of 
js sequence (SEQUENCE ID NO 
clones SEQUENCE ID NO 1, 
i ming overlapping regions of the 
Jc sequence (SEQUENCE ID NO 



LU103 gene and the resultant consensus nuckv 
4) of these clones in a graphic display. Following this, a three-frame translation was 
performed on the consensus sequence (SEQUENCE ID NO 4). The third forward 
frame was found to have an open reading frame encoding a 93 residue amino acid 
sequence, which is presented as SEQUENCE i 

C. Specificity of Expression of ESTs 
Sequence. The consensus sequence, generate 
to the entire updated LDFESEQ™ database (Sir 
search tool. ESTs corresponding to the conse. 



) NO 14. 
responding to Consensus 
section B, supra , was compared 
:nbcr 1997) using the BLAST 
sequence were found in 66.6% 



(22 of 33) of lung libraries and 0.2% (1 of 45(< of non-lung libraries. Therefore, 



30 



the consensus sequence or fragment thereof v. 
often in lung than non-lung tissues. 



und more than 302 times more 



Example 2: Sequencing of 
The DNA sequence of clone 123509J 
the LUI03 gene contig was determined using 



;i >EST-Specific Clope? 
h comprises the 5*-most EST of 
i*;y termination sequencing with 
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:rminators following known methods (SEQUENCE ID NO 5). (F. Sanger et 
' NAS U.S.A . 74:5463 (1977)). 

Because the pINCY vector (Life Technologies, Gaithersburg, MD) contains 
. rsal priming sites just adjacent to the 3' and 5' ligation junctions of the inserts, 
5 ,v;imately 300 bases of the insert were sequenced in both directions using the 

r . . versal primers depicted in SEQUENCE ID NO 8 and SEQUENCE ID NO 9 
i lined from New England Biolabs, Beverly, MA and Applied Biosystems Inc, 

:.:cr City, CA, respectively). The sequencing reactions were run on a 
j-. ;\ acrylamide denaturing gel, and the sequences were determined by an Applied 
10 r»i;>systems 377 Sequencer (available from Applied Biosystems, Foster City, CA) or 
oilier sequencing apparatus. Additional sequencing primers, SEQUENCE ID NO 10 
and SEQUENCE ED NO 1 L were designed from sequence information determined 
bv the initial sequencing reactions near the 3'-ends of the two DNA strands. These 
p imers then were used to determine the remaining DNA sequence of the cloned 
15 i i sort from each DNA strand, as previously described. 

Example 3: Nucleic Acid Preparation 
A. RNA Extraction from Tissue . Total RNA was isolated from solid lung 
tissues or cells and from non-lung tissues. Various methods were utilized, including 
20 but not limited to the lithium chloride/urea technique, known and described in the art 
(Kato et al„ J.Virol. 61:2182-2191,(1987)), Ultraspec™ (Biotecx Laboratories, 
Inc., Houston Texas), and TRIzoI™ (Life Technologies, Inc., Gaithersburg, MD). 

For northern blot analysis, the tissue was placed in a sterile conical tube on 
ice and 10-15 volumes of 3 M LiCl, 6 M urea, 5 mM EDTA, 0.1 M 0- 
25 iiiercaptoethanol, 50 mM Tris-HCl (pH 7.5) were added. The tissue was 
homogenized with a Polytron® homogenizer (Brinkman Instruments, Inc., 
Wcstbury, NY) for 30-50 seconds on ice. The solution was transferred to a 15 ml 
plastic centrifuge tube and placed overnight at -20°C. The tube was centrifuged for 
^' "» min at 9,000 x g at 0-4°C, and the supernatant was immediately decanted. Then, 
30 ! ) ml of 3 M LiCl were added, the tube was vortexed for 5 sec and centrifuged for 
•! " min at 1 1,000 x g at 0-4°C, and the above decanting, resuspension in LiCl, and 
c .r.trifugation steps were repeated. The final pellet was air-dried and resuspended in 
t.1 of 1 mM EDTA, 0.5% SDS, 10 mM Tris (pH 7.5). Next, a 20 \i\ volume of 
: icinase K (20 mg/ml) was added, and the solution incubated for 30 min at 37°C 
35 v. ith occasional mixing. One-tenth volume (0.22-0.25 ml) of 3 M NaCl was added 
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and the solution was vortexed before transfer into another tube which contained il 
of phenol/chloroform/isoamyl alcohol (PCI). The tube was vortexed for 1-3 sc ..! 
centrifuged for 20 min at 3,000 x g at 10°C, after which the PCI extraction was 
repeated twice, then followed by two similar extractions with chloroform/isoam 
5 alcohol. The final aqueous solution was transferred to a pre-chilled 1 5 ml core> 
glass tube containing 6 ml of 100% absolute ethanol, the tube was covered with 
parafilm and placed at -20°C overnight. The tube was centrifuged for 30 min at 
10,000 x g at 0-4°C, and the ethanol supernatant was decanted immediately. Th 
RNA pellet was washed four times with 10 ml of 75% ice-cold ethanol, followe ; 
10 each time by ccntrifugation at 10,000 x g for 10 min. The final pellet was air-di 

for 15 min at room temperature. The RNA was suspended in 0.5 ml of 10 mM H is 
(pH 7.6), 1 mM EDTA, and its concentration was determined 
spcctrophotomctrically. RNA samples were aliquoted and stored at -70°C as eth mol 
precipitates. 

15 The quality of the RNA was determined by agarose gel electrophoresis ( 

Example 5) and staining with 0.5 ^g/ml ethidium bromide for one hour RNA 
samples that did not contain intact 28S/18S rRNAs were excluded from the stuc .< . 

Alternatively, for RT-PCR analysis, 1 ml of Ultraspec RNA reagent wa:. 
added to 120 mg of pulverized tissue in a 2.0 ml polypropylene microfuge tube, 

20 homogenized with a Polytron® homogenizer (Brinkman Instruments, Inc., 

Westbury, NY) for 50 sec and left on ice for 5 min. 0.2 ml of chloroform was then 
added to each sample, followed by vortexing for 15 sec. The sample was left in ice 
for another 5 min, followed by centrifugation at 12,000 x g for 15 min at 4°C. The 
upper layer was collected and transferred to another RNase-free 2.0 ml microfu; e 

25 tube. An equal volume of isopropanol was added to each sample, and the solut ; :>n 
was placed on ice for 10 min. The sample was centrifuged at 12,000 x g for 10 nin 
at 4°C, and the supernatant was discarded. The remaining pellet was washed tw ice 
with cold 75% ethanol, rcsuspended by vortexing, and the resuspended materia 1 v/as 
then re-pelleted by centrifugation at 7500 x g for 5 min at 4°C. Finally, the RN, \ 

30 pellet was dried in a speedvac for at least 5 min and reconstituted in RNase-free 
water. 

B. RNA Extraction from Blood Mononuclear Cells . Mononuclear cells are 
isolated from blood samples from patients by centrifugation using Ficoll-Hypac \z as 
follows. A 10 ml volume of whole blood is mixed with an equal volume of RP /I! 
35 Medium (Life Technologies, Gaithersburg, MD). This mixture is then underlav id 
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with 10 ml of Ficoll-Hypaquc (Pharmac 
minutes at 200 x g. The buffy coat cont; 
diluted to 50 ml with DuIbecco\s PBS (L 
the mixture centrifuged for 10 minutes a 

5 pellet is rcsuspended in Dulbecco's PBS 
RNA is prepared from the isolate 
Kato ct aL, supra . Briefly, the pelleted r 
ml volume and then are resuspended in 1 
3M LiCl, 6M urea, 5mM EDTA, 0. 1 M 2 

10 7.5). The resulting mixture is homogeni: 
homogenate is spun at 8,000 RPM in a B 
4°C. The pellet is resuspended in 10 ml . 
10,000 RPM in a Beckman J2-21M rotor 
resuspending and pelleting steps then are 

15 of 1 mM EDTA, 0.5% SDS, 10 miM Tri 
vortexing and then it is incubated at 37°C 
volume of 3M NaCl then is added and th 
by two cycles of extraction with phenol/ • 
one extraction with chloroform/ isoamyl 

20 addition of 6 ml of ethanol followed by o 
precipitated RNA is collected by centrifu; 
ethanol. The pelleted RNA is then dissol 
7.5). 

Non-lung tissues are used as neg 
25 purified from total RNA by using commt 
cellulose spin columns (RediCol™ from 
isolation of poly-adenylated RNA. Tola. 
(5M guanidine thiocyanate, 0. 1M EDTA 
protection assay. 
30 C. RNA Extraction from polvso 

mixed with 2.5 volumes of 0.8 M sucro: 
MgCl 2 , 50 mM Tris-HCl, pH 7.4) soluti' 
The tissue is homogenized in a Tcflon-g 
100-200 rpm followed by six strokes in 
35 Mechler, Methods in Enzvmologv 1 52:2 



.itaway, NJ) and centrifuged for 30 
he mononuclear cells is removed, 
-hnologies, Gaiihersburg, MD) and 
g. After two washes, the resulting 
:al volume of 1 ml. 
;onuclear cells as described by N. 
uclear cells are brought to a final of 1 
L of PBS and mixed with 2.5 ml of 
aptoethanol, 50mM Tris-HCl (pH 
iiid incubated at -20°C overnight. The 
;;:an J2-21M rotor for 90 minutes at 0- 
! .iCl by vortexing and then spun at 
Mrifuge for 45 minutes at 0-4°C. The 
. .ued. The pellet is resuspended in 2 ml 
7.5) and 400 ^g Proteinase K with 
W minutes with shaking. One tenth 
jxed mixture. Proteins are removed 
roforrn/ isoamyl alcohol followed by 
..hoi. RNA is precipitated by the 
.■might incubation at -20°C. After the 
nion, the pellet is washed 4 times in 75% 
cd in ImM EDTA. lOmM Tris-HCl (pH 

; controls. The mRNA can be further 
:!!y available kits such as oligo dT 
irmacia, Uppsala, Sweden) for the 
inRNA can be dissolved in lysis buffer 
H 7.0) for analysis in the ribonuclease 

. Tissue is minced in saline at 4°C and 
\ TK IW M (150 mM KC1, 5 mM 
staining 6 mM 2-mercaptoethanol. 

hotter homogenizcr with five strokes at 
:nce homogenizer, as described by B. 

118 (1987). The homogenate then is 
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nfuged at 12,000 x g for 15 min at 4°C to sediment the nuclei. The polysomes 
eclated by mixing 2 ml of the supernatant with 6 ml of 2.5 M sucrose in TK I50 M 
i iayering this mixture over 4 ml of 2.5 M sucrose in TK IM) M in a 38 ml 
vnllomer tube. Two additional sucrose TK I50 M solutions are successively 



;:. nnd layer of 6 ml of 1.3 M sucrose. The polysomes are isolated by centrifuging 
. gradient at 90,000 x g for 5 h at 4°C. The fraction then is taken from the 1 .3 M 
: ;rose/2.05 M sucrose interface with a siliconized pasteur pipette and diluted in an 
v qua! volume of TE (10 mM Tris-HCl, pH 7.4, 1 mM EDTA). An equal volume of 

10 r *r j C SDS buffer (1% SDS, 200 mM NaCl, 20 mM Tris-HCl, pH 7.4) is added and 
the solution is incubated in a boiling water bath for 2 min. Proteins next are digested 
with a Proteinase- K digestion (50 mg/ml) for 15 min at 37°C The mRNA is 
purified with 3 equal volumes of phenol-chloroform extractions followed by 
precipitation with 0. 1 volume of 2 M sodium acetate (pH 5.2) and 2 volumes of 

15 " '*)()% ethanol at -20°C overnight. The precipitated RNA is recovered by 

■ c ntrifugation at 12,000 x g for 10 min at 4°C. The RNA is dried and resuspended 
* ; TE (pH 7.4) or distilled water. The resuspended RNA then can be used in a slot 
blot or dot blot hybridization assay to check for the presence of LU103 mRNA (see 
Example 6). 

20 The quality of nucleic acid and proteins is dependent on the method of 

preparation used. Each sample may require a different preparation technique to 
maximize isolation efficiency of the target molecule. These preparation techniques 
are within the skill of the ordinary artisan. 

25 Example 4: Rihonuclease Protection Assay 

A. Synthesis of Labeled Complementary RNA fcRNA) Hybridization Probe 
and Unlabeled Sense Strand. A plNCY plasmid containing the LU103 gene cDNA 
sequence insert (clone 1235095), flanked by opposed SP6 and T7 polymerase 
promoters, was purified using Qiagen Plasmid Purification Kit (Qiagen, 

30 Chats worth, CA). 10 ^g of the resulting plasmid were then cut with 10 U Ddel 

restriction enzyme for I h at 37°C. The cut plasmid was purified using QIAprep kits 
i i qiagen, Chatsworth, CA) and used for the synthesis of antisense transcript which is 
labeled with 6.3 ^iM (alpha 32 P) UTP (800 Ci/mmol)(Amersham Life Sciences, Inc. 
.* . lington Heights, IL) from the SP6 promoter using the Riboprobe® nLYilEQ. 

35 Transcription System (Promega Corporation, Madison, WI) according to the 



- ed onto the extract fraction; a first layer of 13 ml 2.05 M sucrose followed by a 
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supplier's instructions. To generate the sense strand. 10 jlir of the purified ph;... 
were cut with restriction enzymes I0U Xbal and 10 U NotI, and transcribed a 
above from the T7 promoter. Both sense and antisense strands were isolated I v .n 
column chromatography. Unlabeled sense strand was quantitated by UV absor: n 
5 at 260 nm. 

R. Hybridization of Labeled Probe to Target . Frozen tissue was pulveri: J 
to powder under liquid nitrogen and 100-500 mg were dissolved in 1 ml of lysis 
buffer as available as a component of the DirectProtect™ Lysate RNase Protectic. kit 
(Ambion, Inc., Austin, TX). Further dissolution was achieved using a tissue 
10 homogenizer. In addition, a dilution series of a known amount of sense strand ii 
mouse liver lysate was made for use as a positive control. Finally, 45 \il of 
solubilized tissue or diluted sense strand was mixed directly with 1 x 10 5 cpm of 
radioactively labeled probe (1.5 x 10 9 cpm/^g) in 5 \i\ of lysis buffer. Hybridiza u>n 
was allowed to proceed overnight at 37°C. 
15 r RNase Digestion . RNA that was not hybridized to probe was removt i 

from the reaction as per the Direct Protect™ protocol using a solution of RNase A 
and RNase Tl for 30 min at 37°C followed by removal of RNase by Proteinase 
digestion in the presence of sodium sarcosyl. Hybridized fragments protected fn -.n 
digestion were then precipitated by the addition of an equal volume of isopropanol 
20 and placed at -70°C for 3 h. The precipitates were collected by centrifugation at 
12,000 xg for 20 min. 

P. Fragment Analysis . The precipitates were dissolved in denaturing gel 
loading dye (80% formamide, 10 mM EDTA (pH 8.0), 1 mg/ml xylene cyanol, i 
mg/ml bromophenol blue), heat denatured, and electrophoresed in 6% 
25 polyacrylamide TBE, 8 M urea denaturing gels. The gels were imaged and anal> .:ed 
using the STORM™ storage phosphor autoradiography system (Molecular 
Dynamics, Sunnyvale, CA). Quantitation of protected fragment bands, expressc in 
femtograms (fg), was achieved by comparing the peak areas obtained from the test 
samples to those from the known dilutions of the positive control sense strand (s • ; 
30 Section B, supra ). In addition, the concentration of DNA in the lysate was assa\ il 
to estimate the number of cells in the test sample lysates. The results are express \ 
in molecules of LU1 03 RNA/cell and as a image rating score (Table 1). High le^ i 
expression of mRN A corresponding to a sequence selected from the group 
consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID N 3, 
35 SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or complements 
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thereof, indicated the presence of LU1 
lung tissue disease or condition, such r 



A(s), suggesting a diagnosis of a 
:*nccr. 



10 



15 



20 



25 



Tissue 


ID Number 


llNA/ccll 


Score* 


Normal Lung 


C157 


iU 


+ 




C007G 


7 


+ 




C027R 


8 


+ 




C016R 


2 


+ 




C135R 


0.2 


+ 


Malignant Lung 


C011G 


>46 


3+ 




C023G 


U.3 


+ 




C012G 


0 






C033R 


>87 


3+ 




C030 


0 




Normal Lung 


C005R 


0 




Malignant Lung 


C037G 


0 




Normal Colon 


C027G 


(.) 





^Samples with no detectable protected IV: • 
detectable protected fragment (the fg value? 
scored as "+"; samples with detectable pr : . 
10 fold above the std curve were scored as 
protected fragment having fg values 10 foiJ 
as "3+". 



u were scored as samples with 
which were within the std curve) were 
*d fragment and having fg values 2 to 
r"; and samples with detectable 
<r more above the std curve were scored 



Example 5: Northern Blotting 



The northern blot technique was use 
in a complex population of RNA using agar 
hybridization. Briefly, 5-10 \±g of total R> 
(il of a solution containing 40 mM morphil 
7.0), 10 mM sodium acetate, 1 mM EDTA, 
formamide for 15 min at 65°C The denau 
buffer (50% glycerol, 1 mM EDTA, 0.4% i 
and loaded into a denaturing 1 .0% agarose 
10 mM sodium acetate, 1 mM EDTA and 
electrophoresed at 60 V for 1 .5 h and rhv;: 
stained with 0.5 [ig/ml of ethidium broir.:.'. 
with UV light to visualize ribosomal RNA 
onto nylon membranes (Brightstar-Phv . 
using the downward alkaline capillary tnm 



1 to identify a specific size RNA species 
:;e gel electrophoresis and nucleic acid 

(see Example 3) was incubated in 15 
-propanesulfonic acid (MOPS) (pH 
-.2 M formaldehyde, 50% v/v 

! RNA was mixed with 2 \i\ of loading 
romophenol blue, 0.4% xylene cyanol) 

1 containing 40 mM MOPS (pH 7.0), 

M formaldehyde. The gel was 

n RNAse free water. Gels were 
RNAse free water and illuminated 

nds. RNA was transferred from the gel 
ion, Inc., Austin, TX) for 1.5 hours 

i method (Chomczynski, Anal. 
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Bi .101:134-139, 1992). The filter was rinsed with IX SSC, and RNA was 

cn to the filter using a Stratalinkcr (Stratagcne, Inc., La Jolla, CA) on the 

am linking mode and dried for 15 min. The membrane was then placed into a 
hy- mi lube containing 20 ml of preheated prehybridization solution (5X SSC, 

5 50' :nide, 5X Denhardt's solution. 100 |ig/ml denatured salmon sperm DNA) 

anc :ived in a 42°C hybridization oven for at least 3 hr. While the blot was 
pre -'ing, a "P-labeled random-primed probe was generated using the LU103 
insc : . anient (obtained by digesting clone 1 235095 with Xbal and Not!) using 
Ran Primer DNA Labeling System (Life Technologies, Inc.. Gaithersburg, MD) 

10 ace. 1 to the manufacturer's instructions. Half of the probe was boiled for 10 
min, quick chilled on ice and added to the hybridization tube. Hybridization was 
carr nut at 42°C for at least 12 hr. The hybridization solution was discarded and 
the : ' :i was washed in 30 ml of 3X SSC, 0, 1 % SDS at 42°C for 15 min, followed 
by / pi of 3X SSC, 0.1% SDS at 42°C for 15 min. The filter was wrapped in 

15 san : '.]), exposed to Kodak XAR-Omat film for 8-96 hr, and the film was 
devt for analysis. 

' csults of the analysis of RNA quality using an cthidium bromide stained 
agar .-so gel and the corresponding northern blot using LU103 probe hybridized to 
RN As from non-lung tissues and lung tissues are shown in Figure 3A and Figure 

20 3B, respectively. The positions of RNA size standards (in kb) are shown to the left 
of each panel. As shown in Figure 3 A, the LU103 probe detected a 0.75 kb band in 
RNA isolated from lung tissue (lane 6) but did not detect a band in the RNA isolated 
from 1 1 other tissues. Figure 3B shows that the LU 103 probe detected a 0.75 kb 
ban* :.: ;!ie RNA of 5/6 normal lung tissues and 4/6 lung cancer tissues. The 0.75 

25 kb I* r.-.l was not detected in the lung sample of lane 3, nor in the seven non-lung 
sam in lanes 4-10 (which contained colon, colon, lung, lung, ovary, prostate, 
and ;plecn tissue, respectively). Lane 2 was blank. 

Example 6: Dot Blot/Slot Blot 
30 I>ot and slot blot assays are quick methods to evaluate the presence of a 

spc, •sucleic acid sequence in a complex mix of nucleic acid. To perform such 
ass: t-p to 50 \ig of RNA is mixed in 50 \i\ of 50% formamide, 7% 
for: : Iwdc, IX SSC, incubated 15 min at 68°C, and then cooled on ice. Then, 
1 0t 20X SSC is added to the RNA mixture and loaded under vacuum onto a 

35 man rltl apparatus that has a prepared nitrocellulose or nylon membrane. The 
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membrane is soaked in water, 20X SSC for 1 hour, placed on two sheets of 20> 
SSC prcwct Whatman #3 filter paper, and loaded into a slot blot or dot blot vacv u 
manifold apparatus. The slot blot is analyzed with probes prepared and labeled ;i 
described in Example 4, supra . Detection of mRNA corresponding to a sequenc; 
5 selected from the group consisting of SEQUENCE ID NO 1, SEQUENCE ID N 2, 
SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and 
fragments or complements thereof, is an indication of the presence of LU103, 
suggesting a diagnosis of a lung tissue disease or condition, such as lung cancer. 

Other methods and buffers which can be utilized in the methods described in 
10 Examples 5 and 6 ? but not specifically detailed herein, are known in the art and are 
described in J. Sambrook ct al. supra . 

Example 7: In Situ Hybridization 
This method is useful to directly detect specific target nucleic acid sequences 
15 in cells using detectable nucleic acid hybridization probes. 

Tissues are prepared with cross-linking fixative agents such as 
paraformaldehyde or glutaraldehyde for maximum cellular RNA retention. See, L. 
Angerer et aL Methods in Cell Biol . 35:37-71 (1991). Briefly, the tissue is placed 
in greater than 5 volumes of 1 % glutaraldehyde in 50 mM sodium phosphate, pH 
20 7.5 at 4°C for 30 min. The solution is changed with fresh glutaraldehyde solution 
( 1 % glutaraldehyde in 50mM sodium phosphate, pH 7.5) for a further 30 min 
fixing. The fixing solution should have an osmolality of approximately 0.375% 
NaCl. The tissue is washed once in isotonic NaCl to remove the phosphate. 

The fixed tissues then are embedded in paraffin as follows. The tissue is 
25 dehydrated though a series of cthanol concentrations for 15 min each: 50% (twice), 
70% (twice), 85%, 90% and then 100% (twice). Next, the tissue is soaked in two 
changes of xylene for 20 min each at room temperature. The tissue is then soaked in 
two changes of a 1 : 1 mixture of xylene and paraffin for 20 min each at 60°C; and 
then in three final changes of paraffin for 15 min each. 
30 Next, the tissue is cut in 5 Jim sections using a standard microtome and 

placed on a slide previously treated with a tissue adhesive such as 3- 
aminopropyltriethoxysilane. 

Paraffin is removed from the tissue by two 10 min xylene soaks and 
rchydrated in a series of ethanol concentrations: 99% twice, 95%, 85%, 70%, 51)%, 
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10 



15 



20 



25 



30 



30%, and then distilled water twice. Thi 
10 min and pcrmeabilized with 2 |ig/ml ' 
Labeled riboprohes transcribed fi 
4) arc hybridized to the prepared tissue s 
3X standard saline extract and 50% forn 
washing in 2X standard saline citrate anc 
with 100 ng/ml RNase A at 37°C for 30 
illumination with ultraviolet (UV) light u 
cytoplasm is indicative of LU103 mRNA 
visualized by autoradiography. 



as are pre-treated with 0.2 M HC1 for 
:sc-K at 37°Cfor 15 min. 
LU103 gene plasmid (see Example 
and incubated overnight at 56°C in 
Excess probe is removed by 
iormamide followed by digestion 
? robe fluorescence is visualized by 
. microscope. Fluorescence in the 
amatively, the sections can be 



Fxnmple8: Revers e T >;in<;cription PCR 



-specific primers are designed to detect 
. -se transcription PGR using methods 
,?ntial procedure that performs both RT 
■cedure is performed in a 200 |il 
2-Hydroxyethyl]glycine), pH 8.15, 



A One Step RT-PCR Assay . T;n< 
the above-described target sequences by ; 
known in the art. One step RT-PCR is a 
and PCR in a single reaction mixture. T! . 
reaction mixture containing 50 mM (N,I 
81.7 mMKOAc, 33.33 mMKOH, 0.01 r.u:.ml bovine serum albumin, 0.1 mM 
ethylene diaminctetraacetic acid, 0.02 mg/ml NaN 3f 8% w/v glycerol, 150 ^iM each 
of dNTP, 0.25 \iM each primer, 5U rTth polymerase, 3.25 mM Mn(OAc) 2 and 5 \i\ 
of target RNA (see Example 3). Since RNA and the rTth polymerase enzyme are 
unstable in the presence of Mn(OAc) 2 , the Mn(OAc) 2 should be added just before 
target addition. Optimal conditions for cDN A synthesis and thermal cycling readily 
can be determined by those skilled in the ;r The reaction is incubated in a Perkin- 
Elmer Thermal Cycler 480. Optimal cor.:.:;:, -ns for cDNA synthesis and thermal 
cycling can readily be determined by thos • billed in the art. Conditions which may 
be found useful include cDNA synthesis ... F-70°C for 15-45 min and 30-45 
amplification cycles at 94°C, 1 min; 55°-7<T C, l min; 72°C, 2 min. One step RT- 
PCR also may be performed by using a c 1 /. :•. rrszymc procedure with Taq polymerase 



35 



and a reverse transcriptase enzyme, such ; 

R. Traditional RT-PCR . A tradi 
performed using the method of K.Q. Hu 
Briefly, 0.5 ng of extracted mRNA (see I 
|il reaction mixture containing IX PCR i 
mM dNTP, 20 ILRNasin, 2.5 jiM randor 



. ) MLV or AMV RT enzymes, 
i! iwo-step RT-PCR reaction was 
L Virology 181:721-726 (1991). 
::ple 3) was reverse transcribed in a 20 
fcr (Perkin-EImer), 5 mM MgCl 2 , 1 

.earners, and 50 U MMLV (Moloney 
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mi ukemia virus) reverse transcriptase (RT). Reverse transcription was 
pe :! at room temperature for 10 min. 42°C for 60 min in a PE-480 thermal 
cy: i lowed by further incubation at 95°C for 5 min to inactivate the RT. PCR 
formed using 2 jil of the cDNA reaction in a final PCR reaction volume of 50 
5 \il . ning 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 2 mM MgCI 2 , 200 ^iM 
dN ' 5 \iM of each sense and antisense primer (SEQUENCE ID NO 12 and 
SE \ r CE ID NO 1 3, respectively), and 2.5 U of Taq polymerase. The reaction 
wa .-abated in an MJ Research Model PTC-200 as follows: 40 cycles of 
am/!:fication (94°C, 20 sec; 58°C 30 sec; 72°C, 30 sec); a final extension (72°C, 10 

10 mii : and a soak at 4°C. 

C. PCR Fragment Analysis . The correct products were verified by size 
determination using gel electrophoresis. After the gel was stained with ethidium 
bromide (0.5 |ig/ml in TBE buffer) for 15 minutes and destained in water for 10 
miru;i*:> s it was visualized by UV illumination. Figure 4A r lanes 1-5, shows a DNA 

15 bar : v, hich is indicative of a P2 microglobulin-specific PCR product in both normal 
lun rsuc (lanes 1, 2, and 4) and cancerous lung tissue (lanes 3 and 5). Lanes 8- 
12. h contained RNA from the same specimens, shows a 269 base pair LU103- 
spe Jhic PCR amplification product in both normal lung tissue (lanes 8 t 9, and 1 1) 
and cancerous lung tissue (lanes 10 and 12). Lane 7 shows that LU103-specific 

20 amplification of DNA with the same primers gives rise to an approximately 650 base 
pair fragment; thus, some of the lung RNA specimens contained DNA. Figure 4B 
shows that, in contrast to the LU103 RNA-specific amplification product obtained 
from lung tissue RNAs (lanes 1-2), the same conditions of amplification did not 
produce this RNA-specific product from RNAs obtained from prostate tissue (lanes 

25 5-9). breast tissue (lanes 10-14), or colon tissue (lanes 15-19). However, many of 
the ^ RNAs did contain sufficient DNA to produce a DNA-specific product. Lane 4 
shews the DNA-specific amplification product control. Detection of a product 
comprising a sequence selected from the group consisting of SEQUENCE ID NO 1, 
SE y JENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE 

30 ID NO 5, and fragments or complements thereof, indicated the presence of LU103 
ml * : \(s), suggesting a diagnosis of a lung tissue disease or condition, such as lung 
cai.Lcr. 
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Example 9: OH-PCR 

A. Probe selection and Labeling . Target-specific primers and probes arc 
designed to detect the above-described target sequences by oligonucleotide 
hybridization PCR. International Publication Nos WO 92/10505, published 25 June 

5 1992, and WO 92/1 1388, published 9 July 1992, teach methods for labeling 
oligonucleotides at their 5' and 3' ends, respectively. According to one known 
method for labeling an oligonucleotide, a label-phosphoramidite reagent is prepared 
and used to add the label to the oligonucleotide during its synthesis. For example, 
see N. T. Thuong et aL, Tet. Letters 29(46):5905-5908 (1988); or J. S. Cohen et 

10 al., published U.S. Patent Application 07/246,688 (NTIS ORDER No. PAT-APPL- 
7-246,688) (1989). Preferably, probes arc labeled at their 3' end to prevent 
participation in PCR and the formation of undesired extension products. For one 
step OH-PCR, the probe should have aT„ at least 15°C below the T M of the primers. 
The primers and probes arc utilized as specific binding members, with or without 

15 detectable labels, using standard phosphoramidite chemistry and/or post-synthetic 
labeling methods which arc well-known to one skilled in the art. 

B. One Step Olieo Hybridization PCR . OH-PCR is performed on a 200 ^1 
reaction containing 50 mM (N,N,-bis[2-Hydroxyethyl]glycine), pH 8.15, 81.7 mM 
KOAc, 33.33 mM KOH, 0.01 mg/ml bovine serum albumin, 0.1 mM ethylene 

20 diaminetetraacetic acid, 0.02 mg/ml NaN 3i 8% w/v glycerol, 150 nM each of dNTP, 

0.25 \iM each primer, 3.75 nM probe, 5U rTth polymerase, 3.25 mM Mn(OAc) 2 
and 5 (il blood equivalents of target (see Example 3). Since RNA and the rTth 
polymerase enzyme are unstable in the presence of Mn(OAc) 2 , the Mn(OAc) : should 
be added just before target addition. The reaction is incubated in a Perkin-Elmer 

25 Thermal Cycler 480. Optimal conditions for cDNA synthesis and thermal cycling 
can be readily determined by those skilled in the art. Conditions which may be 
found useful include cDNA synthesis (60°C, 30 min), 30-45 amplification cycles 
(94°C, 40 sec; 55-70°C, 60 sec), oligo-hybridization (97°C, 5 min; 15°C, 5 min; 
15°C soak). The correct reaction product contains at least one of the strands of the 

30 PCR product and an internally hybridized probe. 

C. OH-PCR Product Analysis . Amplified reaction products are detected on 
an LCx® analyzer system (available from Abbott Laboratories, Abbott Park, 1L). 
Briefly, the correct reaction product is captured by an antibody labeled microparticle 
at a capturable site on either the PCR product strand or the hybridization probe, and 

35 the complex is detected by binding of a detectable antibody conjugate to either a 



• 
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10 



detectable site on the probe or the PCR strand. Onh 
strand hybridized with the internal probe is detcciah! 
then is indicative of the presence of LU103 mRNA. 
lung disease or condition, such as lung cancer. 

Many other detection formats exist which c; : 
those skilled in the art to detect the presence of ampli. 
derived nucleic acid sequences including, but not lirn 
(LCR, Abbott Laboratories, Abbott Park, IL); Q-beta . 
Naperville, Illinois), branched chain reaction (Chiron, 
displacement assays (Becton Dickinson, Research Tri 



ox containing a PCR 
'Section of this complex 
;;ing a diagnosis of a 

! and/or modified by 
non-amplified LU103- 
. ligase chain reaction 
case (Gene-Trak™, 
rryville, CA) and strand 
;e Park, NC). 



The LU103 polynucleotides which are provide 
are useful as markers of lung tissue disease, especially 
upon the appearance of this marker in a test sample su 
1 5 scrum, can provide low cost, non-invasive, diagnostic 
physician to make a diagnosis of cancer, to help selet : 
monitor the success of a chosen therapy. This markc 
state, altered in a disease state, or be a normal protein t 
an inappropriate body compartment. 



!.k! discussed hereinabove 
lung cancer. Tests based 

is tissue, blood, plasma or 
.formation to aid the 
•jiapy protocol, or to 
. be elevated in a disease 
rhe lung which appears in 
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SEQUENCE LISTING 



«1J GENERAL INFORMATION 



. i; APPLICANT:. ABBOTT LABORATORIES 



TTTLE OF THE INVENTION: REAGENTS AND METHODS USEFUL 
FOR DETECTING DISEASES OF THE LUNG 



NUMBER OF SEQUENCES: 19 

iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 
£C) CITY: Abbott Park 

(D) STATE: XL 

(E) COUNTRY: USA 

(F) ZIP : 60064-3500 

/V) COMPUTER READABLE FORM: 
(A) MEDIUM TYPE: Diskette 
(3) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM : DOS 

£D) SOFTWARE: FastSEQ for Windows Version 2.0 

■vi) CURRENT APPLICATION DATA: 
•;A) APPLICATION NUMBER: 
(3) FILING DATE: 
(C) CLASSIFICATION: 

tvii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Becker, Cheryl L. 

(B) REGISTRATION NUMBER : 35,441 

(C) REFERENCE /DOCKET NUMBER: 5 99 7. PC. 01 

■'ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 847/935-1729 

(B) TELEFAX: 847/938-2623 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 269 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



:i) SEQUENCE DESCRIPTION: SEQ ID NO:l: 




60 
120 
180 
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AGTTGGCACC TTTACCTCTG GACAACATTC TTCCCTTTAT GGATCCATTA AACCTTCTTC ' 7:4 0 
TGAAAACTCT GGGCATTTCT GTTGAGCAC 2 6 9 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:2: 



TGCTACTGCC TTCCTCATCA ACAAAGTGCC CCTTCCTGTT GACAAGTTGG CACCTTTACC 60 

TCTGGACAAC ATTCTTCCCT TTATGGATCC ATTAAAGCTT CTTCTGAAAA CTCTGGGCAT 120 

TTCTGTTGAG CACCTTGTGG AGGGGCTAAG GAAGTGTGTA AATGAGCTGG GACCAGAGGC 180 

TTCTGAAGCT GTGAAGAAAC TGCTGGAGGC GCTATCACAC TTGGTGTG A C ATCAAGATAA 24 0 

AGAGCGGAGG TGGATGGGGA TGG 263 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAGGAAGTGT GTAAATGAGC TGGGACCAGA GGCTTCTGAA GCTGTGAAGA AACTGCTGGA 6 0 

GGCGCTATCA CACTTGGTGT GACATCAAGA TAAAGAGCGG AGGTGGATGG GGATGGAAGA 120 

TGATGCTCCT ATCCTCCCTG CCTGAAACCT GTTCTACCAA TTATAGATCA AATGCCCTAA 180 

AATGTAGTGA CCCGTGAAAA GGACAAATAA AGCAATGAAT ACATT 225 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



GGCAAGTGGA 
AAAAATATCC 
CCTTTGTAGT 
GTTGGCACCT 
GAAAACTCTG 
GCTGGGACCA 
GTGACATCAA 
TGCCTGAAAC 
AAGGACAAAT 



ACCACTGGCT 
CAGATAACTG 
TACTCTGCTA 
TTACCTCTGG 
GGCATTTCTG 
GAGGCTTCTG 
GATAAAGAGC 
CTGTTCTACC 
AAAGCAATGA 



TGGTGGATTT 
TCATGAAGCT 
CTGCCTTCCT 
ACAACATTCT 
TTGAGCACCT 
AAGCTGTGAA 
GGAGGTGGAT 
AATTATAGAT 
ATACATT 



TGCTAGATTT 
GGTAACTATC 
CATCAACAAA 
TCCCTTTATG 
TGTGGAGGGG 
GAAACTGCTG 
GGGGATGGAA 
CAAATGCCCT 



TTCTGATTTT 
TTCCTGCTGG 
GTGCCCCTTC 
GATCCATTAA 
CTAAGGAAGT 
GAGGCGCTAT 
GATGATGCTC 
AAAATGTAGT 



TAAACTCCTG 
TGACCATCAG 
CTGTTGACAA 
AGCTTCTTCT 
GTGTAAATGA 
CACACTTGGT 
CTATCCTCCC 
GACCCGTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
507 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



# 
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(xi) SEQUENCE DESCRIPTION: 



GAATTCGAAT TCGAATTCGT GGAACCACTC 
TTTTAAACTC CTGAAAAATA TCCCAGATAA 
TGGTGACCAT CAGCCTTTGT AGTTACTCTG 
TTCCTGTTGA CAAGTTGGCA CCTTTACCTC 
TAAAGCTTCT TCTGAAAACT CTGGGCATTT 
AGTGTGTAAA TGAGCTGGGA CCAGAGGCT? 
TATCACACTT GGTGTGACAT CAAGATAAAG 
CTCCTATCCT CCCTGCCTGA AACCTGTTCT 
ZvGTGACCCGT GAAAAGGACA AATAAAGCAA 



CTC 

c i ;* 




jw\ TTTTGCTAGA TTTTTCTGAT 

gctggtaact atcttcctgc 
,: :t cctcatcaac aaagtgcccc 

. '.T TCTTCCCTTT ATGGATCCAT 
OCA CCTTGTGGAG GGGCTAAGGA 
TGT GAAGAAACTG CTGGAGGCGC 
•VTO GATGGGGATG GAAGATGATG 
GATCAAATGC CCTAAAATGT 



60 
120 
180 
240 
300 
360 
420 
480 
519 



(2) INFORMATION FOR SEO ID 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : single 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION: SEC ID NO : 6 : 



AGCTCGGAAT TCCGAGCTTG GATCCTCTAG AGOT^CCGCC GACTAGTGAG CTCGTCGACC 



(2) INFORMATION FOR SEQ ZD tn V: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 8 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AATTAATTCC CGGGTCGACG AGCTCACTAG TCGGCGGCCG CTCTAGAGGA TCCAAGCTCG 
GAATTCCG 

(2) INFORMATION FOR SEQ ID HO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AGCGGATAAC AATTTCACAC AGGA 

(2) INFORMATION FOR SEQ ID i: > . D : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 9: 
TGTAAAACGA CGGCCAGT 



CGGGAATT 
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INF0RMATION FOR SEQ ID NO: 10: 



• SEQUENCE CHARACTERISTICS: 
"■ LENGTH: 20 base pairs 
.. TYPE: nucleic acid 
U'J STRANDEDNESS ; single 
'■n TOPOLOGY: linear 



SEQUENCE DESCRIPTION: SEQ ID NO:10: 



TGTGG7. 1 " CTAAGGAAGT 

!?.) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
\A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCACGGGTCA CTACATTTTA 

(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 
in) LENGTH: 2 0 base pairs 
(U) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CTGCTACTGC CTTCCTCATC 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(::i) SEQUENCE DESCRIPTION: .SEQ ID NO:13: 
ATCTTCCATC CCCATCCAC 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(■■!) MOLECULE TYPE: None 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
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Met Lys Leu Val Thr lie Phe Leu Leu Val Thr lie Ser Leu Cys Ser 
Tyr Ser Ala Thr Ala Phe Leu lie Asn Lys Val Pro Leu Pro Val Asp 

Lys Leu Ala llo Leu Pro Leu Asp Asn lie Leu Pro Phe Met Asp Pro 

J ^r- an 4b 



Leu Lys Leu Leu Leu Lys Thr Leu Gly He Ser Val Glu His Leu Val 
Glu Gly Leu Arg Lys Cys Val Asn Glu Leu Gly Pro Glu Ala Ser Glu 

65 70 

Ala Val Lys Lys Leu Leu Glu Ala Leu Ser His Leu Val 
85 90 

(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Asn Lys Val Pro Leu Pro Val Asp Lys Leu Ala Pro Leu Pro Leu Asp 

1 5 10 

Cys 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Asp Asn He Leu Pro Phe Met Asp Pro Leu Lys Leu Cys 
I 5 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Leu Leu Lys Thr Leu Gly He Ser Val Glu His Leu Val Glu Gly Leu 

1 5 10 

Cys 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 
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<B) TYPE: amino acid 

(C) STRANDEDNESS : single 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

<xi> SEQUENCE DESCRIPTION: SEQ ID L 

Leu Arg Lys Cys Val Asn Glu Leu Gly Pro <. ; a Ser Glu Ala 



(2) INFORMATION FOR SEQ ID NO: IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Glu Ala Ser Glu Ala Val Lys Lys Leu Leu G ' Ala Leu Ser His Leu 



1 



5 



10 



15 



1 
Val 



Cys 



5 



10 



IS 
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\ . A method of detecting the presence of a target LU 1 03 polynucleotide 
5 in a it : ■ ^nple, comprising: 

I ;i ) contacting said test sample with at least one LU 1 03-specific 
polynu . l^iuide or complement thereof; and 

( b) detecting the presence of said target LU 1 03 polynucleotide in the test 
sample, wherein said LU103-specific polynucleotide has at least 50% identity to a 
10 polynucleotide selected from the group consisting of SEQUENCE ID NO I, 

SEQl T-NCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE 
ID NO 5. and fragments or complements thereof. 

2 . The method of claim 1 , wherein said target LU 103 polynucleotide is 
15 attache 1 a solid phase prior to performing step (a). 

3 . A method for detecting mRNA of LU 103 in a test sample, 
comprising: 

(a) performing reverse transcription with at least one primer in order to 

20 produce cDNA; 

(b) amplifying the cDN A obtained from step (a) using LU 103 
oligonucleotides as sense and antisense primers to obtain LU103 amplicon; and 

(c) detecting the presence of said LU 103 amplicon in the test sample, 
wherein the LU103 oligonucleotides utilized in steps (a) and (b) have at least 50% 

25 identity to a sequence selected from the group consisting of SEQUENCE ID NO 1, 
SEQUENCE ID NO 2, SEQUENCE ID NO 3 t SEQUENCE ID NO 4, SEQUENCE 
ID NO 5. and fragments or complements thereof. 

4 The method of claim 3, wherein said test sample is reacted with a 
30 solid phase prior to performing one of steps (a), (b), or (c). 

5 The method of claim 3, wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 
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6. A method of detecting a target LU 103 polynucleotide in a test sample 
suspected of containing said target, comprising: 

(a) contacting said test sample with at least one LU103 oligonucleotide as 
5 a sense primer and with at least one LU 1 03 oligonucleotide as an anti-sense primer 

and amplifying to obtain a first stage reaction product; 

(b) contacting said first stage reaction product with at least one other 
LU103 oligonucleotide to obtain a second stage reaction product, with the proviso 
that the other LU103 oligonucleotide is located 3* to the LU103 oligonucleotides 

10 utilized in step (a) and is complementary to said first stage reaction product; and 

(c) detecting said second stage reaction product as an indication of the 
presence of the target LU103 polynucleotide, wherein the LU103 oligonucleotides 
utilized in steps (a) and (b) have at least 50% identity to a sequence selected from the 
group consisting of SEQUENCE ID NO 1 1 SEQUENCE ID NO 2, SEQUENCE ID 

15 NO 3, SEQUENCE ID NO 4, SEQUENCE ID NO 5, and fragments or 
complements thereof. 

7 . The method of claim 6, wherein said test sample is reacted with a 
solid phase prior to performing one of steps (a), (b), or (c). 

20 

8 . The method of claim 6, wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

9 . The method of claim 8, wherein said detectable label is reacted to a 
25 solid phase. 

1 0. A test kit useful for detecting LU 103 polynucleotide in a test sample, 
comprising a container containing at least one LU103 polynucleotide having at least 
50% identity to a sequence selected from the group consisting of SEQUENCE ID 

30 NO 1 , SEQUENCE ED NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5. and fragments or complements thereof. 
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11. A purified polynucleotide or i i :gme i thereof derived from an LU103 
gene, wherein said polynucleotide is capable ' sck ivcly hybridizing to the nucleic 
acid of said LU1 03 gene and has at least 5« v." ieni to a sequence selected from 
the group consisting of SEQUENCE ID NO SE UENCE ID NO 2, SEQUENCE 
ID NO 3. SEQUENCE ID NO 4, SEQUKNC ID :0 5, and fragments or 
complements thereof. 

12. The purified polynucleotide of clair 11, wherein said polynucleotide 
is produced by recombinant techniques. 

13. The purified polynucleotide of clain ,11, wherein said polynucleotide 
is produced by synthetic techniques. 

14. The purified polynucleotide of clair. 1 1 , wherein said polynucleotide 
1 5 comprises a sequence encoding at least one L 7 1 03 -pitope. 

15. A recombinant expression system comprising a nucleic acid sequence 
that includes an open reading frame derived from LU 103 operably linked to a control 
sequence compatible with a desired host, wherein said nucleic acid sequence has at 

20 least 50% identity to a sequence selected from the group consisting of SEQUENCE 
rD NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, 
SEQUENCE ID NO 5, and fragments or complem :nts thereof. 

16. A cell transfected with the recombinant expression system of claim 

25 15. 

17. A cell transfected with a nucleic aci ! sequence encoding at least one 
LU103 epitope, wherein said nucleic acid sequeno- is selected from the group 
consisting of SEQUENCE ID NO 1 , SEQUENCE ID NO 2, SEQUENCE ID NO 3, 

30 SEQUENCE ID NO 4, SEQUENCE ID NO \ am fragments or complements 
thereof. 
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18. A composition of matter comprising an LU 103 polynucleotide or 
fragment thereof, wherein said polynucleotide has at least 50% identity to a 
polynucleotide selected from the group consisting of SEQUENCE ID NO K 
SEQUENCE ID NO 2, SEQUENCE ID NO 3, SEQUENCE ID NO 4, SEQUENCE 

5 ID NO 5, and fragments or complements thereof. 

1 9. The test kit of claim 10 further comprising a container with tools 
useful for collection of said sample, wherein the tools are selected from the group 
consisting of lancets, absorbent paper, cloth, swabs and cups. 



20. A gene or fragment thereof which codes for an LU103 protein which 
comprises an amino acid sequence with at least 50% identity with SEQUENCE ID 
NO 14. 



10 



15 



21. A gene or fragment thereof comprising DNA having at least 50% 
identity with SEQUENCE ID NO 4 or SEQUENCE ID NO 5. 
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> 12350% GAATTCGTGG AACCACTGGC TTGGTGGATT TTGCTAGATT TTTCTGATTT 
Consensus GAATTCGTGG AACCACTGGC TTGGTGGATT TTGCTAGATT TTTCTGATTT 

>1235095 TTAAACTCCT GAAAAATATC CCAGATAACT GTCATGAAGC TGGTAACTAT 
Consensus TTAAACTCCT GAAAAATATC CCAGATAACT GTCATGAAGC TGGTAACTAT 

>1235095 C1TCCTGCTG GTGACCATCA GCCTTTGTAG TTACTCTGCT ACTGCCTTCC 
>1235531 TGCT ACTGCCTTCC 

Consensus CTTCCTGCTG GTGACCATCA GCCTTTGTAG TTACTCTGCT ACTGCCTTCC 

>1 235095 TCATCAACAA AGTGCCCCTT CCTGTTGACA AGTTGGCACC TTTACCTCTG 
>1 235531 TCATCAACAA AGTGCCCCTT CCTGTTGACA AGTTGGCACC TTTACCTCTG 
Consensus TCATCAACAA AGTGCCCCTT CCTGTTGACA AGTTGGCACC TTTACCTCTG 

> 1235095 GACAACATTC TTCCCTTTAT GGATCCATTA AAGCTTCTTC TGAAAACTCT 
>i?: ,; i'i3] GACAACATTC TTCCCTTTAT GGATCCATTA AAGCTTCTTC TGAAAACTCT 
Consensus GACAACATTC TTCCCTTTAT GGATCCATTA AAGCTTCTTC TGAAAACTCT 

> I 235095 GGGCATTTCT GTTGAGCAC. 

> 1 235531 GGGCATTTCT GTTGAGCACC TTGTGGAGGG GCTAAGGAAG TGTGTAAATG 



Consensus GGGCATTTCT GTTGAGCACC TTGTGGAGGG GCTAAGGAAG TGTGTAAATG 

>' 235531 AGCTGGGACC AGAGGCTTCT GAAGCTGTGA AGAAACTGCT GGAGGCGCTA 
> 13794 17 AGCTGGGACC AGAGGCTTCT GAAGCTGTGA AG/VV\CTGCT GGAGGCGCTA 
Consensus AGC'RiGGACC AGAGGCTTCV GMGCTGTGA AGAAACTGCT GGAGQCGCIA 

> 1235531 TCACACTTGG TGTGACATCA AGATAAAGAG CGGAGGTGGA TGGGGATGG 
> 1379417 TCACACTTGG TGTGACATCA AGATAAAGAG CGGAGGTGGA TGGGGATGGA 
Consensus TCACACTTGG TGTGACATCA AGATAAAGAG CGGAGGTGGA TGGGGATGGA 

> 137941 7 AGATGATGCT CCTATCCTCC CTGCCTGAAA CCTGTTCTAC CAATTATAGA 
Consensus AGATGATGCT CCTATCCTCC CTGCCTGAAA CCTGTTCTAC CAATTATAGA 

>1379417 TCAAATGCCC TAAAATGTAG TGACCCGTGA AAAGGACAAA TAAAGCAATG 
Consensus TCAAATGCCC TAAAATGTAG TGACCCGTGA AAAGGACAAA TAAAGCAATG 

>1379417 AATACATT 
Consensus AATACATT 



>1379417 



AAGGAAG TGTGTAAATG 



FIG.1 
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